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In the present note we shall give a generalization to functions in several 
variables, of the following classical theorem of Hadamard.” 


A. Let 
a(z) = 2s an2”, b(z) = > baz”. 


be any two power series converging in the neighborhood of the origin, and let (z) be 
the composite series 


a(z) = >, Gnbnz”. 
0 


If E,c'*, ape, Epe'*, OS o < 2m, are the vertices of the stars of a(z), b(z), m(z) 
respectively, then 


f, = g.lb. {Est,--0}. 


Osd<2r 


The reader will find a proof of this classical theorem in an annex to this note. 

The proof of the generalization will be simple enough but its formulation was 
by no means evident. In fact, the classical theorem itself will emanate in the 
following alternative form. 


I. Let 
a(x) = Lanz", b(y) = 2 bay” 


be any two power series converging in the neighborhood of the origin, and let x(z, y) 
be the composite series in two complex variables 


r(x, y) = > Anz” bay”. 
If 2’, y’ are any two numbers, then x(x, y) is analytic along the segment 
(p2®, py”), 0s p < g.l.b. {Eo(2")n-o(y")}, 
Osd<2r 
SS 


‘ National Research Fellow. ; 55-64 
‘J - Hadamard, ‘“‘Théoréme sur les séries entiéres,’’ Acta Mathematica, 22 (1898), ; 
P. Dienes, 7'he Taylor Series, Oxford (1931), esp. 346-348; S. Mandelbrojt, Les singularités 


oO analytiques representées par une Séries de Taylor, Fascicule 54, Paris, (1932), 
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where &(x°)e*, ne(y°)e"® are vertices of the stars of the functions a(x°z), b(y°z), these 
functions being considered as functions of the complex variable z. 


The transition from one to several variables will be based on the following key 
theorem of Hartogs.’ 


B. Let f(ai , +++ , Ly; 2) be analytic in z in a fixed domain T containing the origin 
for every set (a1, +++ , X,) from the region 


Ih(2°):|a. — ai | +--+» + la — a | <4, 


and analytic in (a1 , «++ , 432) inthe region (I s(a’); | 2 | < 170) for some r > 0. 
Then f(x: , +++ , 2y32) is analytic in (a, --+ , 432) in the region (I;(2’); 7). 


The corresponding generalization of Hurwitz’s theorem‘ will involve notions 


that were introduced recently.” 


Hadamard composite series. Let a(x, --- , 24), b(y:,--- , y,) be two power 
series, each in several complex variables. Grouping their terms in homogeneous 
polynomials, we put 


(1) a(t, +++ ,2) = Ly An(m, +++ 5%); bys, +++, Ye) = Ly Balyr, +++ 4). 
Our composite series will be a function in all occurring variables, 
(2) a(x; y) = dX Aan, +++, Ly) By, “++ Yy) 


We observe that this composition is invariant under any affine transformation 
in each group of variables. A composition of the coefficients themselves would 
fail in this and other respects. 


Circular regions. We shall first make a very simple statement concerning the 
regions of uniform convergence of the three series (1), (2). These regions will be 
denoted by G., G, G, respectively. Each of them is a circular region. A 
region in the (x, --- , z,)-space is circular if with any point (21, --- , %) it 
contains all the points (ka, --- , kx,), | k| <1. 





3 F. Hartogs, ‘‘Zur Theorie der analytischen Funktionen mehrerer unabhangiger Verind- 
erlichen, etc.,’’ Math. Annalen, 62 (1906), 1-88, esp. 12-20. 

‘A. Hurwitz, ‘Sur un theoréme de M. Hadamard,”’ Comptes Rendus, Paris, 128 (18%), 
350-353; S. Pincherle, ‘‘A priposita di un recente teorema del Sig. Hadamard,”’ Rendicontt 
del’ Acc. du Sc. de Bologn,a 3 (1898-99), 67-74; S. Mandelbrojt, loc. cit.*, 21-22. 

5 W. T. Martin, ‘‘Special regions of regularity of functions of several complex variables,” 
Proc. Nat. Acad. Sc., 22 (1936), 33-35. 

6 For a discussion of this and other fundamental notions see the tract by H. Behnke and 
P. Thullen, “Theorie der Funktionen mehrerer komplexen Verinderlichen,’’ Ergebnisse der 
Mathematik und ihrer Grenzgebiete, Berlin (1934), 34, 41-47. 
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G, is described by’ 
H(m1,---,2) <1, 
where 
(3) H.(a1 ecw Z,) ™ lim sup {lim sup | A,(a} wT 2,) ia 


The functions H, , Hy , H, satisfy the relation 
H,(x; y) = lim sup {lim sup | A,(2’)B,(y’) ord 


z’=zy'=y | n= 
S {H(t ++, %)Holy:, «++, y)}}, 
hence: 
Il. The composite function w(x; y) is certainly analytic within 
Hi(a1,--- ,%) Aly, ---, yr) < 1. 


This is a circular region contained in the circular region G, ; indeed, the point 
(t, +++, 2yj Yr,» *** , Yr) belongs to G, if numbers a, B with | aB| = 1 exist 
such that (aa , +++ , a&,) belongs to G, and (By: , --- , By,) belongs to G, . 


Stars. The star S; of f(a1, --- , 24) is defined as follows:* a point (z}, ---, 
1,) belongs to S; if f(x) is regular along the segment pz}, --- , pm ,0 S p <1. 
We shall need the following lemma which is based on Hartog’s key theorem. 

C. Let f(t: , +++ , 2.) be regular at the origin. Corresponding to any x}, --- , 
0 0 
t, let E(x), 


0 < (2°) < @, 


be the vertex of Sy in the direction ti,-++,2y, that is, let &(x°) be the greatest 
number such that f(a:,--+,2,4) is regular along the segment px\,--- , p%,, 
0<p< &2°). Then 
(4) £(x°) = lim inf £*(z) 

z=r9 





where §*(x) is the vertex on the positive real axis in the z-plane of the star of f(x1z --- , 
1,2), this function being considered as a function of z. 


For the proof of this lemma, let ¢’ denote any positive number less than 


és infz.20 &*(x) for fixed x},---,2%. There exists an e = ¢(#’; x) > 0 such 
that 

g(x) = ¢’ for 2, +++, 2, inI,(2’). 
SS 


* Compare W. T. Martin, ‘Special regions of regularity of functions of several complex 
variables.” To appear in Annals of Math. 

_*See footnote 5 and B, Almer “Sur quelques problémes de la théorie des fonctions analy- 
tiques de deux variables complexes,’’ Arkiv for Mathmatik, Astronomi och Fysik, 17 (1922-23). 
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There exists a 5, depending only on « and x such that for any z from I,(z') 
and any real y satisfying: | g | < 6 the new point ze” falls into I,(z°). 


The vertices in the z-plane of the star of S(aiz, --+ , 2,2) as a function of z are 
obviously £*(ze'*)e"’. Hence for z from I(x) the function f(az --- , z,2z) is 
regular in the sector 

lz| < &, |argz| <6. 


On the other hand, for x from I;(z) and | z| less some 7 > 0 the function 
f(xiz, «++ , 2,2) is regular in all variables. Defining T as the region consisting 
of the points of the sector 


le|< &, |argz| <6 
and of the points of the circle | z| < 7, we may now apply theorem B. We 
find that f(zz, ---,2,2) is analytic in (J,(x°); T). Hence f(x, ---,2,) is 
analytic along the segment pz; , --- , pt, forp < &’. Hence é’ < £2’). But?’ 


was any number smaller than the quantity on the right side of (4). Therefore 


t(x") > lim inf ¢*(z). 


But the greater sign cannot hold. Otherwise there would exist numbers ¢” 
and £’” and a I;(2’) such that 


t(2°) > &” > &" > £*(z) for x from I;(2°). 
Since f(z1 , --- , 2) is regular along the segment 
xi p, ee , LeP, p< E(x"), 
there exists and J,.(x°) such that f(a, --+ , 2,) is regular along the segments 
Lip, +++, Ly, get, x from To(2’). 
Hence there exist an ¢ and a g such that f(x2z, --- , 2,2) is regular for x from 


I.(z°) and z from the sector 
le|se", |argz| Sg. 


Let (2; , --- ,2,) be a point common to J;(2°) and I,(x°). By the definition of 
£*(x’) we have &*(x’) = ¢’”’ and this contradicts the assumption ¢’” > £*(z’) 
This completes the proof of lemma C and the theorem proper will follow easily. 


III. Let 
a(x) = > A,(n, ia” Ly), b(y) = > By, ms fy Yr) 


be any two power series, each in several complex variables, converging in the neighbor- 
hood of the origin, and let r(x; y) be the composite series in all occurring variables, 


a(x; y) = > Aaa, dive » Ty) By, 8) Yr) 
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If (a y’) is any composite point then a(x; y) ts analytic along the segment 
(px; py’), Os p' < gb. {t(2°e”) nly°e)} 
Osd<2r 
where . 
ae’ t(x°e*); ye nye) 
are vertices of the stars of the given functions a(x, --+ , 2), b(y1, --- , y,). 


The vertices of the stars of the functions 
a(xz) = > A,(x)z”, b(yz) = p> B,(x)z", 
as functions in z, may be written in the form 
e” E*(xe), e* n* (ye). 


By Hadamard’s classical theorem (see A) 


»» A,(x)Ba(y)w” 
is analytic along the segment 
0 < R(w) < glb. {E*(xe"”)n*(ye)}, ¥(w) = 0. 
Osd<2r 
Hence 
(xz; yz) = >, An(x)B,(y)e™ 
0 
is analytic along the segment 
z=, 0<p < glb. {¢*(xe"’)n*(ye”) }. 
Osd<2r 
Introducing the quantities 
e(a°; y’), o*(x; y) 
in analogy to ¢(x°), &*(x) and n(y°), n*(y) puts the last relation into the form 
[s*(x; y)I? = gdb. {e*(xe)n*(ye™)}. 
Osds2r 
Since &*(ze") is for fixed x the vertex distance of a function in one variable, it is 
continuous from below in the variable 8. The same holds for n*(ye”), and 


consequently for ¢*(xe")n*(ye"”). But such a function attains a minimum on a 
closed set. Therefore we have 


[c*(a; y) PP = E*(xe? =) nt (ye Po). 
There exists sequences 2” —> 2°; y"— y’, 3, = 3(z", y") — d such that 


lim inf E*(xe'? Sn *(ye Pe) = lim *(2"e"*) -lim n*(y” et), 
eal y n= n=o 
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In consequence of lemma C the last expression is 


> £(x°e°). (ye). 


But 
lim inf ¢*(z; y) = ¢(2°; y°), 
zi y=2; y? 
hence 
[s(2", y VP = e(a"e*)- (ye), 
Hence 


0 iW 


[G25 y VP 2 gl.b. ((a"el)-n(y'e™)} 
and this is precisely the statement of our theorem. 
The Hurwitz composition. With (1) we form 


(5) c(x;y) = DD C(x; y) 
where 


u(t; ») = 3 (7t) Aula) Bs). 


k=0 


If circular regions of convergence of our three series are characterized by 
H,(x) < 1, Ho(y) < 1, H-(2, y) < 1 then, compare (3), 


(6) H.(x;y) S H(z) + Avy). 
A more refined relation, analogous to (6), is 
(7) he(x; y) S ha(x) + holy) 


where ha(x), he(y), h-(x; y) describe the diagrams of the series a(x), b(y), ¢(z; y). 
The relation follows from 


log | A(zp) | | 
p 


z=z p=o 


ha(x°) = lim sup iim sup 


where 





A(a) = Dy Ax) 


0 


and from the analogous formulas for hi(y), h.(x; y), see footnote 5. 
If a(x"), B(y’), v(x"; y’) are the reciprocal values of the vertex distances of our 
three functions then 


(2°; y") S Lub. fa(z’), By’), a(a"e*e* + B(y’e Je} 


taken over such real values of g and y for which the expression in which they 
occur is positive. 
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A generalization. Let ki,---,k,;h,---,l, be any positive integers and 
suppose that the given expressions A,(z), B,(y) occurring in the Hadamard and 
Hurwitz composites are polynomials of weight n in the variables 2, --- , 2, ; 
yi,-**5Y», if these variables are given the weights k,,---,k,;h,---,l,. 
Then all assertions remain valid, if the circular regions of uniform convergence 
are the corresponding Cartan circular regions pertaining to the indices (k,, --- , 
i), (a--+,4) and (hi, --+,ks; i, ---,1) respectively and the stars and 
diagrams are similarly generalized, see footnote 5. 


An illustration. Let X,(x), p = 1, 2, --- denote a sequence of homogeneous 
polynomials in aj',---,2,”, let || X,|| be the maximum of | X,(z) | for 
in| +--+. +|a, | = 1; and let a, be a sequence of numbers. If the sequence 
|| X, || is bounded and the series 








i) 

y Lael 

p=1 | X> | 
is convergent, then the series 


i 2) 


p PETS... Oa 

p=1 1 — X,(2) 
is uniformly absolutely convergent in the neighborhood of the origin, and 
defines a function which will be our a(x). Its expansion >.> A,(x), where 


ie) 


A,(z) = D> {X,(zx)}" 


p=1 


is its Cartan diagonal expansion for the indices 


and converges in the Cartan circular region 
(8) | X,(z) | < 1, p=1,2,---, 


see footnote 7. It does not converge in a larger Cartan region since it has a 
singularity on the curve in which the analytic manifold 
3 0 0k 
tm = ay2",---,2, = we 


intersects the boundary of (8); in fact, the function in z 


Ok) Ap 
ae) = Lyrae 


has a singularity on the corresponding circle in the z-plane.’ Hence its Cartan 
region is 
ances 





* See, for example, A. Pringsheim, ‘‘Uber bemerkenswerte Singularitétenbildung bei 
yp Partialbruchreihen,” Sitzungsberichte der Bayerischen Akademie (1927), pp. 145- 
4, where also various references are given. 
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(9) lu.b. | X,(x) | < 1. 
‘Let 
. Ba = : eee 
b(y) = AC) oe 1— Y,(y) ) p> B,lys, ’ Yr) 


be a similar expression in the y-space with indices \i,---,,; h,---,],; 
h =1---A,. Its Cartan region is 

IY) | <1, beets. 
or 
(10) lu.b. | Ya(y) | < 1. 

q 


The Hurwitz composite c(x; y) is the expression 


co) 


> apBe 


pga l — X,(x) — Ya(y) 


If we add the assumption x = \ then it has properties analogous to those of 
a(x), b(y). In particular, its Cartan region is 


lu.b. | X,(x) + Y,(y)| < 1. 





In agreement with our previous assertion we have 


Lub. | X,(z) + Y.(y)| < Lup. | Xp(z) | + Lab. | Yo(y) |; 


the equality not holding as arule. The Hadamard composite is 


i) 


- ap Bg 
w(2; y) ig >» = X,(zx) Y,(y) ; 





the resulting Cartan region being 
(11) lu.b. | Xp(x) Y,(y) | < 1. 
Pq 


Thus in this case we even have the equality 
lu.b. | X,(z)¥¢(y) | = Lub. | X,(x) | - Lub. | Ye(y) |; 
P.qd Pp q@ 


in particular, the inequalities (9), (10) imply the inequality (11), in accordance 
with our general theorem. 

The stars of our functions do not lend themselves to a similarly simple analytic 
representation. But the result relating to stars can be partially illustrated. 
In general, the star contains the Cartan region; in fact the radial distance (2) 
of the Cartan region and the vertex distance £(x) of the star satisfy the relation 


p(x) = g.lb. &(xe**). 
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From the self explanatory argument 


r(z;y) = g.Lb. ¢(ze"*; ye’*) 
¢? 
= g.l.b. {e-Lb. e(xe™e™)n(ye"*e™)| 
¢g 0 
> g.l.b. {e-Lb. t(xe"’ e**) g.l.b. n(ye” e*)| 
8 ¢ ¢ 
= g-l.b. { p(xe*’)o(ye*”)} 


= p(x)o(y) 


it follows that a point on the boundary of the Cartan region of x(x; y) cannot be 
singular (that is a star vertex) unless it is a composite of such points of the 
functions a(x), b(y). And this statement can be verified for the Cartan regions 
(9), (10), (11). 

To return to the Hurwitz composite if the constants x, --- , ky jA1, °°: 4A» 
are all 1, then the indicator functions h(x), ho(y), h-(x; y) have the values 


ha(x) = Lu.b. {real part X,(z)} 
hy) = lub. {real part Y,(y)} 
h(x; y) = lu.b. {real part [X,(z) + Y-,(y)]}, 


and hence precisely 


h(x; y) = ha(x) + holy). 


Annex. ProofoftheoremA. Let e’’p(v) denote the star vertices of a function 
f(z) which is regular around the origin. Since f(z) is regular along the segment 
z= e"p,0 < p < p(¢) it follows that p(y), as a function of ¢, is continuous from 
below. 

Since both given functions £» , #,-» are continuous from below, their product, 
as a function in 8, is also continuous from below. Such a function attains its 
minimum on every closed set. Let an(y), Ba(y) be sequence of continuous 
(periodic) functions converging increasingly towards £, , 7, as n > ©. _ Then, 
for every y the sequence an(y)Ba(y — &) converges increasingly towards £ fj,» . 
Since the functions of the sequence are continuous and converge increasingly, 
and their limit function attains its minimum, we have 

lim g.l.b. {an(#)Baly — #)} = g.lb. {Es}. 
n=eo 0<0<27 Osd<2r 
The appromating functions an(¢), Ba(y) can be so chosen that the stars e” an(y), 


eit Ba(e) are bounded by rectifiable curves. Hence it is sufficient to prove the 
following statement. 
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If a(z) is analytic in the closed star e’* a(y) and b(z) in the closed star e** B(¢) and 
the star boundaries are rectifiable then x(z) is analytic in the interior of the com- 
posite star 


(12) z= e'’p 0<p< min {a(dB — 8)}. 
Osd<2r 


If ¢ is a point of the contour e~” /8(3),—we shall denote this contour by C,— 
and z is a point in (12) then the contour zt is interior to the star e“a(y). In 
fact, zt has the form e**” p/B(8) and p/8(#) is less than a(y — 8). Hence we 
can form the integral 


(13) | awo(t)4 


for all z in (12). Furthermore if z runs over a closed set S interior to (12) and t 
runs over C, then the totality of points éz is closed, and thus a closed subset of 
the interior of the star in which a(z) is being considered. Therefore for all 
t on C and z from S, a(tz) is uniformly continuous in (¢, z) and analytic in z. 
Hence the function (13) is analytic in (12). Let a denote the minimum of 
a(y) and 6 the minimum of B(v). Then a(éz) is analytic for |t| S a/ | z| and 
b(1/t) for |t| 21/8. Let | z| be sosmall that C lies in the annulus (1/8, a/ | z/). 
Then, in the integral (13), C can be replaced by any concentric circle in the 
annulus. On this circle we can substitute >.> ant"z” for a(tz) and ).o bat” 
for b(1/t), and thus we identify the integral (13) with the function 7(z). 


PrINcETON, N. J. 
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THE POLYNOMIAL F,(x) AND ITS RELATION TO OTHER FUNCTIONS 


By H. Bateman 
(Received December 14, 1936) 


1. The function U,(z), which was used in a recent paper,’ may be defined by 
means of the equation 


d Ft ond —2z 
(1) (4 1) exp (—te-*) = U,(te™™) 
which is easily generalized. We may, for instance, write 
(2) r.( 4. - 1) r{= 1) exp (—te™*) = Umn(te™) 
dx dx a 
and obtain a power series 
(3) Unn(e) = LS p28 — 1)F,(—28 — 1). 
s=0 . 
There is a similar generalization with any number of factors of type 
d 
r.(4 —T 1) 
The function Um,(z) is related to the function Pm,,(u) used in a former 
paper,’ in fact 


1 zr—1 on 
(4 ———— e.a4 = = * mn 
) pale (24) | € (t) dt R(x) > 0 
and we may write for |z| > 1, 
‘ 1 zr—1 = ae: 
o Laat - Berne ann 
5 ipaq 2X ( Ya" F,»(—2s — 1)F,(—2s — 1), 
while for |x| < 1 
(6) _! p (254) ~ > (-2) 
ipa lel 2 ( x)’ F,,(2s + 1)F,(2s + 1). 
Since F,(— 2s — 1) > 1 it follows from (5) that when z > 1, P»n,,(z) > 1. 


2. By means of equation (1) or the relation U,(¢) = exp (—2)Z,(¢), where 
Z,(t) is the polynomial F(—n, n + 1; 1, 1; ¢) it is easily shown that 
Ss 

i Bateman, Duke Mathematical Journal, vol. 2 (1936) 569-577. 

H. Bateman, Annals of Mathematics, vol. 35 (1934) 767-775. 
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(7) [ r’U,(r) dr = T(v)F,(2r — 1). 

The associated equation 

(8) I a Umn(t)dr = T(r) Fm(2v — 1)F,(2v — 1) 
may be proved by means of the equation 

(9) Vaile) 8? % tans? 


s=0 


in which the coefficients @m,n,, are those of the expansion 


m+n 


(10) F,,(2v — 1)F,(2v — 1) = p> Om.n.evv + 1)--> (v+s8—1) 


which gives (9) when it is used to evaluate the left hand side of (2). The 
relation (9) shows clearly the nature of the entire function U,,,(z) and a more 
general function such as Uim,(z) is readily seen to be also equal to a polynomial 





multiplied by e *. 
From (6) and (7) we infer that 

6 _ (—)* (: _ *) 
(11) | U»(ar)U,(7) dr = i + ‘ Peun z+1 ° 
A proof of this equation may be obtained from 

—zr 1 xz-—l 

(12) [ é U,(r) dr = i+yz Fe (53) 
by writing e ~~” in place of x and using (1). We find in a similar way that 

‘ _ (-) (: — *) 
(13) i] Ui(27)Umn(r) dr = l+a Pian z+1 
where 


(14) whoPiuc. taba oh (2) PF, (¢) F, (<) me 


Also, if 


d d d —te~2z —2z 


we have 


~— 1 af 
(16) [ Uimals)de = pt Pinn(2=*) 


and so on. 
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3. The function V,(z), orthogonal to U,(z), may be defined by means of the 
equation 





d 9 eo oe ” 
(17) r(¢ 1) [ exp [—t(r — 1)e™*] == V,.(te**) 
for the left hand side is equal to 
m rr ae ” U,(s)ds 
(18) i Ualttr — eS = | phe 


and so the present definition agrees with the former definition. 
It is readily seen that 


mt) [ ervsnar = 2 0.(244), R(z) > 0 





and so, on replacing x by e ” and using (1), we find that 


2 x+ 
“a 1 n..(2+4), R(x) > 0. 





(20) | Un(ar)Va(r)dr = (—)" 


x 


The orthogonal relation is now obtained by making z — | and using a result 
obtained in”. 
Defining the function Vma(z) by the equation 


(21) Vinn(Z) = i] cut t 





or the equivalent equation 


(22) r.(¢ _ 1) r.(4 _ 1) J exp [—t(r — ery S = Vinn(te”) 


we have also 


mi Mt. 2 z+1 
(23) i] é mn(T) dr = + omy | 0n..(2 +7) 


x 


The last equation gives a rule for the determination of the coefficient a, in an 
expansion 


(24) f® = Xa, V,(0), t>0. 
n=0 
After some reductions the rule is found to be 
(25) G, = (2n + 1) | o‘at| Fa(1 + 2t 2) 10 
0 


and is an alternative to the rule 


(26 + h+) | ” Us at. 
0 
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In the particular case when f(t) = t’ both rules give [for R(v) > 0] 


| (27) dn = (2n + 1)T(v)F,(2» — 1). 
4. Defining the function U™'"(z) by the relation 
(28) Faa(Z — 1) "tid = U™ "(te **) 
dz 


in which F,,,,(x) is the function introduced in” we have the power series 








(29) ume) = SP ry g(—28 = 1) 
s=0 ° 
and the definite integral 


ees Oe Sa z—1 t—1 
(30) | e”U™""(r)dr = i 4_p.(2- +) P(2= *) 


This indicates that for |x| > 1 


ae Pa(= * 1) Pa(2 : ') = 0 (2 Fao —28 = 1) 


l+z2 x+1 t+1 s=0 


and there is a corresponding expansion for |2|< 1. We have also 


























(32) | r U""(r)dr = T'(v)F'm,n(2v — 1). 
0 
We may further define a function V”’"(z) by the equation 
ad ae ri —t(r—1) e— 22 dr a (m,n) (4-22 
(33) Pah 1) [ id 7*= Vee), 
it has the property 
- 2 de yon ™ 1 (: 4+ 1 x ) > 
— [ (r)de roi Whe) eS 1 “e 
A generating function for U”’"(t) is readily found from (30), we have for | y| <1, 
lze|< 1 
Dy") = ; e ' Io(bt), 
m=) n=0 (1 — y)(1 — 2) 
(35) : p 
where a+b = (14H), (a0) = (1+4). 
l-y 1-—z 





If U""(t) = e‘'Z™"(t), the exponential factor in the generating function of 
Z”'"(t) is 


_ yt Qa | 
A exp| eed 


and it is readily seen that Z”'"(£) is a polynomial of degree m + n int. 

















ion of 





POLYNOMIAL F(x) AND ITS RELATION TO OTHER FUNCTIONS 307 


Putting y = z in the generating function we have the expansion 
1 a (=) ‘ = n rmn* 
(36) aa = 27 T.W) 
where 


* ‘ n—8,8 -t ‘ (—4t)"( 1)! 
Ta) = UN) =e De Tee Ij! 


n+1-, _ 4t\" _3. 2\ dz 
ani Le :) (n+ aga’, 


and C is a simple contour enclosing the origin. It is readily seen that 





(37) 





1 sin(n+ 1)6 





‘ —zt pn* — 
(38) | eT, (dt = fest ae x > 0, 
where 
x—l1 
cos 9 = F411" 


The asterisk in 7's (#) has been introduced to distinguish it from the Tchebycheff 
Polynomial 7',(¢). 

On account of the orthogonal property of the Legendre polynomial it follows 
from (30) that 


i] Vor) UU" "(r)dr = 0 if mAn 
0 


(39) 
1 


1 , ow ey ew 
2r+1 A, 





(40) | V.(r)U"'"(r) dr = 


2;p=m+n-+s, s!A, = 1-3-5 --- (2s — 1) 





may be deduced from the well known expansion (for m < n) 


(41) P,.(z)P.(z) = > C.Pnan2e(2), 


(2m at 2n site 2s + |} = [oo wo - 


and the related expansion 


(42) U™"(t) = > C. Um +n—2al(t). 
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The relations (37) and (39) give 


[ Vo(t)T2(t)dt = (1/n +1) for neven 
(43) a 


= 0 for nodd. 


This is a particular case of a more general relation 





(44) | ; Vin(t)T(t)dt = 4 l , P,,(cos 0) sin (n + 1)6d0 


which may be deduced from (19). 
(39) and (35) give the relation 


/ ° e ™ Iy(bt) Vo(t) dt 





(r + 1)(s +1) + V(r’ — 1)(s8’ -1) 
(r+ 1)(s + 1) — V(r? — 1)(8 - 1) 


r=atb, s'=a-—b, a2b20. 


(45) 





= 2[(r? — 1)(s* — 1)}* log 


A series may be obtained for the corresponding integral with V,(t) in place of 
Vo(t) but it is rather complicated. 
It may be shown by means of (30) that 


(46) U™"(t) = ae _ aa + an — | 





where 
c, = m(m + 1) + n(n + 1). 
Co = (m — 1)m(m + 1)(m + 2) + 4m(m 4+ 1)n(n + 1) + (n - 1) 
(47) n(n + 1)(n + 2). 
cs = (m — 2)(m — 1) m(m + 1)(m + 2)(m + 8) + 9(m — 1) m(m +1) 
(m + 2) n(n + 1) + 9(m(m + 1)(n® — n)(m + 1)(n + 2) + (n- 2) 
(n — 1) n(n + 1)(n + 2)(n + 8). 


The related expansion for F'»,,(—1 —2z) is 








relations 


x x(x — 1) a(zx—1)(4@—-2), |. 
j (48) Frmn(—1 — 2x) = 1+ a ap + “ep + Cs (313 , ii 
It should be mentioned that the functions Z,(t), T*(t) satisfy the recurrence 
: 


Tans 





(49) (4n + 2) [ Z,(s)ds = (2n + 1)Z,(t) — (nm + 1)Zayilt) — nZn-lt) 













| 
— 
~— 


| 


l 
— 
— 


wce of 


+1) 
, — 2) 


rrence 
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(0) ($+ 1)irta@) + Pra) = o($ - 1) 780 


F(T) - Tra] 
= (n + 1)Trai(t) + (2t — 2n — 2)TR@) + (n +1 + 2)TRAW). 


When the Neumann series 
(52) M(2't) = 2° 2 (4n + 2)Za(t)Jonss() 


is arranged according to powers of z it is found that 


68) MG) = SEN = st, 0,0) 


in the notation of Wrinch.® This is an entire function of z*t and so the Neumann 
series may be expected to converge for all finite values of z. The associated 
power series is 


2n 


— 2 
2 gman 2 
and this must represent an entire function of x. If we retain only the highest 
power of ¢ in Z,(¢) so as to obtain a function of 2°t we get simply the power 
series for M(2"t) as associated series. 
The series (46) suggests the formula 


(54) Zt) =G I M(2’ t) Jony(x) dz, 

0 
the general value of the integral being obtained by introducing an exponential 
factor e“ under the integral sign and passing to the limit u — 0. If it is 


calculated by expanding the function M(z’t) in a power series and associating 
the value 2°"I'(n + m + 1)/I'(n — m + 1) with the divergent integral 


i] . xo Jong (x) dx 


the power series for Z,(t) is obtained. 


—_—. 





*D. Wrinch, Quart. J. 50 (1927) 204-224. An asymptotic expansion for M(z) may be 
derived from the general formulae of Fox, Proc. London Math. Soc. (2) 27 (1928) 389-400. 
The function M is related also to functions used by L. Pochhammer, Math. Ann. 41 (1893) 
197-218 and A. R. Forsyth, Quart. J. 50 (1924) 97-148. Forsyth gives the asymptotic value 


ss 0, iy ~ A exp ($2) cos ($4/3-2t — 4) where A is a constant which he leaves unde- 
ermined, 











eee tine ee Te ee 
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It should be noticed that the integral 


I Jony1(us)Zn(u) du 
0 

which is obtained from (54) by Hankel’s inversion formula, preceded by the 
change of variables s = us, where s’t = 1, is convergent but its value, instead of 
being s’M(s°) is s ’M,(s’) where M,(s’) is the polynomial of degree 2n obtained 
by truncating the series for M(s). We may thus write 


(55) M(x) = lim Z(xt~*) Jonzr(t) dt. 


no 0 


It may be recalled that Forsyth’s analysis shows that the roots of the function 
f(z, 0, 0) are all real and negative, the numerically smallest roots being (—1.058)* 
(—2.234)*, (—3.437)*, (—4.653)* approximately, while for large values of n, 
the n* root z, is given approximately by the formula 


r V/3 3 
(56) Zn = 6561. (1 = 6n) m 


A further property of the function M is expressed by the equation 


oo hr 
i] M(2* tet” ‘dt = 2 sin (vir) [ Jo(x sin 6) (tan 6)” ‘dé 
(57) 0 7 0 
0<rv<l. 
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ON THE ORDER AND TYPE OF INTEGRAL FUNCTIONS BOUNDED 
AT A SEQUENCE OF POINTS 


By V. Ganaratur IYER 
(Received September 1, 1936) 
1. Introduction. Let f(z) be an integral function. Let 


M(r, f) = max: | f(z) |. 


The order p and the type k(f) of f(z) are defined by the relations 





p= mg PEMD, ap) = eg EMD, 
r7w og r ro r? 


The following theorem has been proved by Miss Cartwright :’ 


TuroreM (A): Let f(z) be an integral function of order one and type k(f) < m. 
Let f(tn) = O11), n = 1,2,3---. Then f(z) is bounded along the whole of the 
real axis. 

1.1. The following result is an easy deduction from theorem (A): 


THEOREM (1): Let f(z) be an integral function of order one and type less than x. 
Let f(+n) = O(1), f(4in) = O(1). Then f(z) will reduce to a constant. 

Proor: From theorem (A), we deduce that f(z) is bounded along the whole 
of the real and imaginary axes. By applying a classical theorem of Phragmen 
and Lindelof,” we conclude that f(z) is bounded in the whole plane and therefore 
is a constant. 

1.2. Theorem (1) states that the type of an integral function of order one 
bounded at the points z = -kn, z = -in cannot be less than 7 unless it reduces 
toaconstant. This is a special solution of the general problem of determining 
the order and type of an integral function taking a set of bounded values at a 
prescribed sequence of points. The author has considered elsewhere’ some cases 
of this general problem where the points at which the function of order one is 
supposed to be bounded are distributed along a single line. The results are 
correspondingly less precise than those obtained in this paper where the function 
is supposed to be bounded at points lying on two lines as in theorem (1). Similar 
results are obtained when 0 < p < 1, by supposing that the function is bounded 


ee 


: ‘On certain integral functions of order one,’”’ Quar. Jour. Math., Oxford Series, Vol. 7, 
No. 25, (1936), 46-55. 
* See, G. Valiron, ‘Lectures on Integral Functions,’’ p. 125. 
*“On integral functions of order one and finite type,” Jour. Ind. Math. Soc., New 
Series, Vol. 2,No.1. We shall refer to this paper as I in the sequel. 
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at sequences of points lying on both halves of a line instead of on one half only 
as is the case considered in another paper* by the author. The case p > 1 is 
not treated since the analogous results could be obtained by the process employed 
in §§3-3.5 of II. Theorem (A) itself does not seem capable of generalisation 
since it depends essentially on the fact that sin rz, the canonical product for the 
sequence {-:n}, is bounded along the whole of the real axis. This is not true 
always even when we restrict ourselves to sub-sequences of integers. For 
instance, the canonical product for {-:A,} where {A,} is composed of the odd 
integers along with the even squares, is 


sin 34 +/z sin }i +/z 
inV2 Ariz 


which is not bounded along the real axis. 

1.3. Let {z,} be a distinct sequence of complex numbers such that | z, | > ©. 
We shall speak of its exponent of convergence p as its order. It is supposed 
that 0 < p < «. Let o(z) be the canonical product with simple zeroes at 
2 = 2,. It is well-known that the order of o(z) is p. We write, for shortness, 


(1) Pl, o] = 2 £1 


n=l oa’ (Zn) Z—2n , 





COs $72: 








where f(z) is any integral function and the series on the right side of (1) converges 
absolutely and uniformly for all z except at z = z, ; this happens to be the case 
in all applications considered below. P[f, o] represents a meromorphic function 
with simple poles at z = z, and having the same principal part as the function 
S(z)/o(z). Let each term in (1) be replaced by its modulus. The resulting 
expression is denoted by P[f, o]. 

1.4. Let O < Ai < A2--- < An < --- be a sequence tending to infinity whose 
order is one. Let n(r) denote the number of these \, not exceeding r. The 
sequence {,} is said to be measurable and of density D when lim,.,,, n(r)/r = D. 
It is easy to see that >, 1/A2, converges in this case. Let 


(2) : ‘a (2 : *). 


n=1 n 


Let 5 be the index of condensation’ of {\,}. The following results are known’ 
and will be required in the sequel: 





‘“‘On integral functions of finite order bounded at a sequence of points,’”’ Ibid., No. 2. 
This paper will be referred to as II in the sequel. . 

‘ For the definition of the index of condensation, see, Bernstein, ‘‘Séries de Dirichlet,” 
Borel Tracts, 1933. When {Aj} is measurable the index is given by the relation (4). When 
lim, _,.(An41 — An) > 0 the index is zero. For the properties quoted, see pp. 279-203 of 
Bernstein’s treatise. 
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ily TaroreM (B): Letz = re”. The following relations are valid: 
i iv 
- jim 198 | Atv |. xD] sin y| 
on (3) 
he tie log M(r, A) = <D 
"ue r+  - 
4 where y is not a multiple of x. Also 
1 


—— 1 


1.5. Let {An} be a sequence of order p, 0 < p < 1, and let lim,_.. n(r)/r? = D. 
We say that {X,} is measurable of order p and density D. Let 


. 6) AG) = If (1 - 2), 


n=1 n 
an Here, we define the index of condensation of {\,,} by the relation 
28S —— l 1 
6 oii — lox’ ——-,. 

) aoe |AOn)| 

It may be noted, in passing, that the index as defined by (6) can be negative 

which cannot happen for the index as understood in §1.4, for p = 1 (see §5 below). 
ges The following properties of A(z) given by (5) are known:® 
“a THEeorEM (C): A(z) given by (5) satisfies the relations: 
m lim log M(r, A) = mD cosec r° 
Ing (7) re r? 

iy 

\OSe lim log | A (re) | = 7D cosec 7’ cos p(x _ ly) 
The re ” 
D. where y is not a multiple of 2x. 





2. In this section, we are concerned with functions of order one. We prove ie 
the following i 


THEOREM (2): Let {An} and {u,} be two measurable sequences of order one and 
wn’ densities D, and D, respectively. Let their indices of condensation be zero. Let 
f(z) be an integral function of order one such that f(+2n) = O(1), f(tun) = O(1). 
Then, either k(f) = xD = min: (wD, , xD,) or else f(z) reduces to a constant. 
Proor: If D = 0 the result to be proved is trivial. So we suppose D > 0. 


0. 2. We assume k(f) < #D and show that f(z) is a constant. Let A(z) and B(z) 
7 be the canonical products for {+A,} and {-iy,} respectively. Let us write 
Then IT sae. 

3 of *See, Paley and Wiener, ‘‘Fourier Transforms in the Complex Domain,” Amer. Math. 


Soc. Collo. Publications, XIX, p. 79. 


a 
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An = —-ny Hn = —M-n, forconvenience. Let F(z) = 2” f(z),m=0,1,2,---, 
We have 


. , Frn(An) 1 . , Fy, (ipn) 1 
P[Fn, AB] = ; : 7 
(8) Pm, AB) = 2’ arc) BO) 2— Ie + Lm AGua)B Gm) FI’ 
where the accent, as usual, indicates that n = 0 is to be omitted. The indices of 
condensation being zero, we get by (3) and (4) 























eee | et lh ae engl, 
(9) 2.5)"%\aGanoo |= += 
a 1 Fn(ipn) 
i = ae | Se © He 
ate [itn] | Aun) B’ pt) si “ 


We see from (9) that (8) converges absolutely and uniformly for all z except at 
the zeroes of A(z) B(z). Put 


Fn(z) 
A(z) B(z) 
By using the methods of I, §§4-4.3, we conclude that H,,(z) is an integral func- 


tion of order one. Also it is easy to see, by using (9), that on any line 
amp.(z) = 6, not a multiple of 37, 


(11) lim P[F,,, AB] = 0. 


|e|-7 ° 


(10) H,,(z) = — P\|F,,, AB). 


On the other hand, on amp.(z) = @, not a multiple of $7, we get 
(12) lim — os 
|z|—>00 |z | 4(z)B(z) 


in virtue of (3). Now D, | sin @| + D, | cos 6| attains the maximum V (Di + D;) 
on the four lines 6 = +a, 6 = +(r — a), where 0 < a < 3x. Hence on these 
lines 


log < k(f) — r[D,| sin 0| + D,| cos 6]] 














F,,(z) 
(13 lim | see) oe 
) iz | A(z) B() 
since k(f) < #D < r~/(Dx + Di). Combining (11) and (13), we find 
lim H,(z) =0 . 
|zj—0 


along these. four lines. Using the Phragmen-Lindelof theorem mentioned in 
§1.1, we conclude that H,,(z) = 0 so that 


oe)... mom 
Now, let g(z) = co + c1z + coz’ + --- be any integral function of order one and 


type k(g) < 7D. Then 


ZL ttatt oe 





Li 


or 








of 
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where 8 < wD. Hence, in virtue of (9), the double series 


> Plenz”f, AB] 


(m,n) 


converges. So we get’ by (14), 
Se) Ply, AB 


(15) A@B) 

In particular, we can take g(z) = z"f(z), m = 0,1, --- , so that 
Or 

(16) A(z)B(2) = Piz f, AB). 


Starting from (16) we can repeat the argument and conclude that 
LO |= 


for all p = 0,1, 2,---. Since f(z) is bounded at the zeroes of A(z)B(z), it 
follows from (9) that the double series 


b pif : AB 
(pn) Pp: 
converges so that (17) gives 

ef? 
A(z) B(z) 


Since e’ = O(1) at the zeroes of A(z)B(z), the methods of I §§4~4.3, enable us 
to conclude that 


(18) = Pie’, AB]. 


e! = A(z)B(z) P{e’, AB] 


is an integral function of order one. This shows that f(z) is a polynomial of 
degree one at most. But f(z) is bounded at the zeroes of A(z)B(z). Hence 
f(z) reduces to a constant. 

2.1. By putting a more stringent hypothesis on the values of f(z) at the zeroes 
of A(z)B(z), we can obtain more precise results. We, first, prove 


THEOREM (3): Let {An} and {un} be two measurable sequences of order one and 
densities D, and D,. Let their indices of condensation be 6, and 5,. Let f(z) be a 
function of order one such that 


lim Ed log | f(+An) | = —d < 0. 
(19) no Ayn 
lim . log | f(ttyn)| = —dy < 0. 


Let xD = min:[r-V/(D? + D2), rD, + d, — 8, 7D, +d, — &). Thenk(f) = xD 


or else f(z) is identically zero. 

















i Sey 


— 
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Proor: In virtue of (19) we get in the notation of §2, 











22, (51? aa Pe 
(20) ‘ Flip.) 

a m\Ubn cae bon 

dim, 7] °8 |AGu,)B' Gua | =~ ("Pa + ds — 4). 








The case D = 0 is trivial. So we suppose D > 0. Then (20) shows that 
P[F,, AB] is absolutely and uniformly convergent except at the zeroes of 
A(z) B(z) and we can repeat the argument of §2 to show that 


F,,(z) 2” f(z) ” 

(21) T@B@ = A@B@ =P PL AF 
when k(f) < x D, if we remember that (13) is valid if only k(f) < r -V (Di + D}), 
If g(z) be a function of order one and type k(g) < x D, the relation (15) and hence 
all the relations that follow hold good so that we conclude that f(z) is a constant. 
By (19) this constant is zero. 

2.2. An interesting case is when d, = d, = «©. Here we can suppose that 
5, and 6, are merely finite. So we get 





TuroreM (4): Let {X,} and {u,} be two measurable sequences of order one and 
densities D, and D,. Let their indices of condensation be finite. Let 


lim, ~ log | f(s) | = lim ~ log | f(binn) | = —@, 
neo n n—-2 Mn 


where f(z) is an integral function of order one. Then k(f) = mV (D\ + Di) or 
else f(z) is identically zero. 
2.3. Taking A, = un = n, we have 6, = 6, = 0 and so we can state 


THEOREM (4-a): Let f(z) be an integral function of order one such that 
lim * log | f(+n) | = lim * log | f(4in)| = —o. 


Then the type of f(z) cannot be less than + V2 unless f(z) is identically zero. In 
particular, this is true when | f(+:n)| < &*"!*" and | f(in) | < 6%" '°*" where 
a and B are positive. 

2.4. Remark: Theorem (4-a) shows that the result of theorem (4) is a best 
possible one since the function (sin rz sinh 2z)/z” which vanishes at z = =, 
z= tin, n = 1, 2, --- satisfies the condition of theorem (4) and has precisely 
x V2 for its type. 

2.5. The significance of these theorems can be best realised by taking some 
concrete examples. Let {usn} and {vin}, n = 1, 2, --- be two bounded se 
quences, not all the terms being equal to a fixed number. Then we can assert 
that the function 








~ (—fy"«, “Yt ee ee | 


n= Sinh rn fa pe 


sin rz sinh rz | ~~ : 
nao 27Sinh rn z — 7n 
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is of order one and type 2. The accent, as usual, indicates that n = 0 is to be 








omitted. If vin = Vin = eS" n = 1,2, --- we can assert that the function 
‘ — (—1)"e"'*! log || 1 ae (—1)"e7!*! toe In 1 
ne | sinh mn z—n ig 2s tsinhan z—in 


is of order one and type = x V2. 


3. We treat the cases 0 < p S 3 and 3 < p < 1 separately. We first prove 


TuroreM (5): Let {rx} and {un} be two measurable sequences of order p, 
0 <p < 3, and densities D, and D,. Let their indices of condensation be zero or 
negative. Let f(z) be a function of order p such that f(n) = O(1), f(—un) = O(1) 
then k(f) 2 xD cosec xp where D = min : (Dj, D,) or else f(z) is a constant. 
Proor: We suppose D > Osince D = Oistrivial. Let 


A(z) = II (1 ~ £) 


n=) n 


he II (1 +2) 


(22) 


n=1 


We suppose k(f) < aD cosec zp and show that f(z) reduces to a constant. 
Let, as before, F,,(z) = z”f(z). We have 


-.  F'in(Xn) 1 ~ Fn(—n) 1 
On PU. ARs [= 'Q.)BO.) 2 =, + 2 AmB) 24 =| 








Now, by (6) and (7), 











a 1 Fn(Xn) < 
lim i log A(n)BQn) s nD, cosec 7p = — aD cosec Tp 
- (— pn) 
—— 1 F — pe 
lim —; lo = - < —rD, cosec rp S —7D cosec rp 
note |A(—Hn)B(—mn) | 4 








since the indices of condengation are zero. Hence P[F,,, AB] converges abso- 
lutely and uniformly except at the zeroes of A(z)B(z). Let 


Fn(z) 
A(z)B(z) 


By using the methods of II, §§2.31-2.32, we conclude that H,,(z) is an integral 
function of order p. On amp. (z) = 6, not a multiple of 7, we have, from (23) 
and (7), 


(25) Hn(z) = ~ P[Fn, AB). 





(26) lim P[F,, AB] = 0 


|2|—+20 


and 


(27) iim sie F,,(z) 


ie l2P ”|A@)B@) < K(f) — x cosec rp[D, cos p(x — |6|) + D, cos p| |]. 
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The expression D, cos p(r —|6|) + D, cos p|@| attains the maximum 
V (Di + Di, + 2D,D, cos rp) onaline@=a,0<a<-7. Since k(f) < rD cosec 
mp, (27) shows that on 6 = a 

_ Fa (z) 
28 lim ———— = 0. 
(28) || 20 A(z) B(z) 
If 0 < p < 3, we conclude from (26), (28) and the Phragmen-Lindelof theorem 
that H,,(z) = 0. If p = 3, (28) is true on lines @ = a + 7, where 7 > Ojs 
sufficiently small. We can again apply the Phragmen-Lindelof theorem and 
deduce that H,,(z) = 0. So we get 


2” f(z) - 
= AB]. 
With (29) as the starting point we can repeat the argument of §2 and show that 
f(z) reduces to a constant. 
3.1. The analogue of theorem (4) runs as follows and can be proved similarly: 


TuHeorEeM (6): Let {rn} and {un} be two measurable sequences of order p, 
0 < p S 3 and densities D, and D,. Let their indices of condensation be finite. 
Let f(z) be an integral function of order p such that 


lim i 


nec vn 


log | f(x) | = lim 7 log | (—u) hay 


Then k(f) = m cosee rp V(Dx + Di + 2D, D, cos xp) or else f(z) is identically 
zero. 


4. We now consider the case } < p <1. Here we prove 


THEOREM (7): Let {r,} and {un} be as in theorem (5), the order p being such that 
4<p<l. Letf(z) be afunction of order p, such that f(An) = O(1), f(— un) = O(1). 
Then k(f) 2 x D, where D is the greater of the two numbers min : (Dy, D, cosec Tp); 
and min : (D,, D) cosec rp), or else f(z) reduces to a constant. 

Proor: Suppose k(f) < aD. In the notation of §3, we find that the relations 
(22) to (26) are valid. It is in the estimation of 


D, cos p(x — | 6|) + D, cos p| 6 | 


that difficulties are introduced owing to the fact that cos mp is negative. But in 
any case, the expression x cosec xp [D, cos p(x — | 6|) + D, cos p| 4 |] takes 
the value 7D, for 6 = + 32/p and wD, for 6 = + (x — 3x/p). If we denote 
by a the angle }7/p or  — 42/p according as D, or D, is the greater, we find 
that (28) is valid on the lines @ = + a + 7 where 7 > 0 is sufficiently small. 
We can now apply the Pragmen-Lindelof theorem and conclude that (29) is 
true, from which we deduce that f(z) is a constant. 
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4.1. Similarly we prove 


TuroreM (8): When {An} and {un} are supposed to have merely finite indices of 
condensation, the conclusion of theorem (7) is true if f(z) is such that 


lim 1 hog | f(a) | = lim = log, |f(—mn)| = —o, 


no An 
4.2. When the densities are equal, D, = D, = D, say, the expression 
D{cos p(x — | 6|) + cos p| 0|] 


attains its maximum value 2D cos 37p on the four lines @ = +a, 6 = +(r — a) 
where) < © <a. Therefore if 8(o) = min : (cosec zp, cosec }xp), we can state 


TuroreM (9): Let {An} and {un} be two measurable sequences of order p, 
1 <p <1 and equal density D. Let their indices of condensation be zero or nega- 
tive. Let f(z) be of order p such that f(\xn) = O(1), f(— un) = O(1). Then 
k(f) 2  D B(p) or else f(z) is a constant. If the indices are supposed merely 
finite, the conclusion remains true provided 


lim 5 


no ne 


oh rg 
log | fx) | = lim =] log | f(—ms)| = —@. 


4.3. The difference in the results for 0 < p S 3 and 3 < p < 1 arises from the 
fact that cos rp = 0 in one case and cos zp < 0 in the other case. The maximum 
Vv (Di + Dj + 2D, D, cos rp) of D, cos p(x — ||) + D, cos p | | can be equal to 
D, sin rp if D, + D, cos xp = 0 which can always be verified by adjusting the 
ratio D,/D,. If also 2 < p < 1, the conclusion of theorem (7) is a best possible 
one, since in this case the the function A (z)B(z) itself has the type rD,. There- 
fore when 3} < p < 1 the ratio of the densities play a large part in the statement 
of the conclusions of theorem (7) and (8) but in no case can the type go below 
the limit given in these theorems. 


5. As an illustration, let Xn = un = n’’,0 <p <1. It is shown in II, 
$4.2, that 


1 
A’(n1/?) 
Therefore we can state the following theorems: 


TaEorem (5-a): If f(z) is a function of order p, 0 < p < 3, and f(+n'”) = 
O(1), then k(f) = x cosec xp unless f(z) is a constant. 


lim ! og = —7 cot mp. 


no 








TuEoreM (6-a): If f(z) is of order p,0 < p S 4, and 
lima (1/n) log | f(n"”) | = — &, 


the type k(f) = w cosee 3p, or else f(z) is identically zero. 
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TueEoreM (9-a): If f(z) is of order p, } < p < land 
lima (1/n) log | f(-tn"”) | = — @, 


then k(f) = w min : (cosec zp, cosec $7), or else f(z) = 0. 

5.1. Conclusion: In theorems (5)-(8), we can take into account the actual 
values of the indices of condensation especially when these are negative as in 
theorem (5-a) when 0 < p < 4. These results show that the type of a function 
bounded at a sequence of points of the type considered in this paper, is, in 
general, controlled by the densities and the indices of condensation of the se- 
quences in question. In connection with theorem (5-a), it might be of interest 
to mention that when it is merely assumed that f(n”") = O(1) instead of 
f(+n"") = O(1) as in the theorem mentioned, it can be proved that f(z) 
reduces to a constant if k(f) < cot mp. 


Mapras University, INp1aA. 
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MAXIMAL CONVERGENCE OF SEQUENCES OF HARMONIC 
POLYNOMIALS 


By J. L. Wausx 
(Received August 19, 1936) 


1. Introduction. The concept of maximal convergence of sequences of 
polynomials in the complex variable z exerts a unifying influence in the entire 
theory of approximation to an analytic function by polynomials. The property 
of maximal convergence is shared by all sequences of polynomials of best 
approximation as measured in a large variety of ways, and is shared likewise by 
many sequences of polynomials defined by interpolation. 

The general theory of approximation of harmonic functions in two variables 
by harmonic polynomials is analogous to the theory of approximation of analytic 
functions by polynomials in z, and has received some attention in recent years.’ 
It is to be expected that the concept of maximal convergence would unify also 
the theory of approximation (especially best approximation) by harmonic 
polynomials, and it is the object of the present paper to show how that expecta- 
tion can be fulfilled. As definite advances made in the present paper, we 
mention the following: introduction and study of the concept of maximal 
convergence of harmonic polynomials, in such detail that by analogy new . 
results are suggested or proved even for maximal sequences of polynomials in z; 
detailed study of degree of approximation and best approximation by harmonic 
polynomials on a point set whose complement is multiply connected (hitherto 
untreated,—older methods are inapplicable); the obtaining of new significant 
results even for approximation in the sense of least squares as measured by 
integration over a single Jordan curve. 

This new study of maximal convergence of sequences of harmonic polynomials 
is broadly analogous to the older study of maximal convergence of sequences of 
polynomials in z, and is here to be derived from the older study. Nevertheless 
there are many differences both of method and result, so that the new study is by 
no means a trivial or obvious application of the older study. 

In the sequel terminology and notation not explicitly mentioned are uniform 
with the writer’s recent book,” to which the reader is referred also for an account 
of the theory of maximal convergence of sequences of polynomials in the complex 


variable. References not otherwise indicated are to that book. 
KL 


a for instance a recent report, Walsh, Bull. Amer. Math. Soc., vol. 35 (1929), pp. 
4, 


J * Interpolation and Approximation by Rational Functions in the Complex Domain (New 
York, 1935). The reader may refer to this book also for the history of approximation by 
Polynomials in z. 
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Throughout the present paper we shall deal with the z(= x + iy)-plane. A 
function u(z, y) is harmonic at a point if it is harmonic throughout a neighborhood 
of that point, and is harmonic in a region if there it is continuous together with its 
first and second partial derivatives, and satisfies Laplace’s equation. Two 
functions u(z, y) and v(z, y) are conjugate in a region if there they are continuous 
together with their first and second partial derivatives, and if the functions 
satisfy the Cauchy-Riemann differential equations: 


du ov ous 

dx y’ ays 
If u(x, y) and v(x, y) are conjugate in a region, they are harmonic there. If 
u(x, y) is given harmonic in a simply connected region, there exists a function 
v(x, y) conjugate to u(x, y) in that region; the function v(z, y) is uniquely 
determined except for an additive constant, which is arbitrary. Whenever 
u(x, y) and v(x, y) are conjugate in a region, so also are v(x, y) and —u(z, y). 
If u(x, y) is harmonic in a region, so are du/dx and du/dy; the functions du/az 
and —du/dy are conjugate there. 


2 Lemmas on harmonic functions. It will be convenient to have for reference 
several easily proved and well known lemmas concerning harmonic functions. 

LemMA I. Let S be a limited simply connected region, and let S; be an arbitrary 
closed subregion interior to S. There exists a number M’ depending only on S and 
S,, such that for every function U(x, y) harmonic and in absolute value not greater 


‘ than n in S we have 


6U 
Ox 


(1) 








] 
< M’n, aU) = Ms, (x, y) in Sy. 


Let Se. denote an arbitrary closed region bounded by an analytic Jordan 
curve, where Sz lies interior to S but contains S, in its interior; such a region 
S: exists, as we see at once by mapping S onto the interior of a circle. Let 
G(x, y; a, 8B) denote Green’s function for S: with pole in the point (z, y) and 
running codrdinates (a, 8). We have 


() Uy) = 2 | V@,A) dean, 
us ov 


where (z, y) is interior to S:, v is the interior normal, and the integral is taken 
over the boundary of S.. By differentiation of (2) with respect to x and y 
respectively, and by the inequality | U(z, y) | < 7 valid on the boundary of &:, 
we obtain inequalities (1).* 

Lemma II. Let S; be an arbitrary closed region bounded by an analytic Jordan 
curve, and let (a, b) be a fixed point interior to S,. Then there exists a number M" 





’ Of course Lemma I can be proved also by covering S; with circles interior to S and by 
using Poisson’s integral for the function U(z, y) in each circle. 
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depending only on S; and (a, b) such that for a function W(x, y) with continuous 
first and second partial derivatives in S, with the property 








aw (z, y) aw (zx, y) | my 
(3) oan =, eR =, (x, y) in Si, 
we have also 
(4) | W(z, y) ba W(a, b) | Ss M"'n, (z, y) in Si . 


We have the well known relation 


(x,y) aw aw ) 
(5) W(z,y) — Wa, b) = (2 dz + dy }, (x, y) in Si, 





where the integral is taken over a path having no point exterior to S, ; this 
integral is then independent of the particular path chosen. There exist paths 
of the kind described from (a, 6) to all points (z, y) of S,, such that the paths 
are in length less than some number independent of (z, y).‘ Inequality (4) 
follows at once from equation (5) and inequalities (3). 

Lemma III. Let S be a limited simply connected region, let S, be an arbitrary 
closed subregion interior to S bounded by an analytic Jordan curve, and let (a, b) 
be a fixed point interior to S;. Then there exists a number M'” depending only on 
8, S:, and (a, b) such that for every function U(x, y) harmonic and in absolute 
value not greater than n in S we have 


| Viz, y) — Va, b)| = Mn, (x, y) in Si, 


where V(x, y) is conjugate to U(x, y) interior to S. 
Lemma I is applicable and yields inequalities (1). By the Cauchy-Riemann 
equations 


uw vv wv wv 
dx ay’ ay dx’ 


we thus have inequalities on the first partial derivatives of V(x, y), and Lemma 
III follows from Lemma II. 

Iemma IV. Let S be a limited simply connected region, let S; be an arbitrary 
closed subregion interior to S bounded by an analytic Jordan curve, and let (a, b) 
be a fixed point interior to S,. Then there exists a number M“ depending only on 


TT 


‘It is obviously possible to choose such paths, for if we map the region S; onto the 
interior of the unit circle y :|2’| = 1 by a transformation 2’ = (z) so that (a, b) corre- 
sponds to the origin, we have 


dz’ 
'(z) 


atiy e* 
[ |dz| = i 
a+ib 0 


the latter integral may be taken over a line segment (of length not greater than unity), and 
the function 1/| #’(z) | is uniformly bounded on and within y. 
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S, S:, and (a, b) such that for every function U(x, y) harmonic interior to S and 
for which we have 






















au 


Ox =, (x, y) in S, 


(6) 








we have also 
b iv ° 
(7) U(x, y) — U(a, b) — ey -b) s M""n, (x, y) in S,; 


if for the function U(x, y) harmonic interior to S we assume 


aU 
ay 





S 7, (a, y) in S, 





instead of (6) then we have 


< M*"n, (az, y) in Sy. 





U(2, y) — Ula, b) — 27%) (@ — a) 





Let S2 denote a closed region interior to S, bounded by an analytic Jordan 
curve, and containing S; in its interior. The functions dU/dz and —@aU/dy are 
conjugate in S, so by Lemma III inequality (6) implies 
aU(x, y) _ aU(a, b) 

dy dy 
By inequality (6) and Lemma II we now have inequality (7). The proof is 
similar if we replace (6) by the corresponding inequality on aU/dy. 


s M'"'n, (x, y) in Se ° 











3. Degree of convergence, general sequences. The lemmas just established 
have immediate application to degree of convergence of sequences of harmonic 
functions. 

THEOREM 1. Let S be a limited simply connected region, and let S; be an arbi- 
trary closed subregion interior to S bounded by an analytic Jordan curve. Let the 
functions un(x, y) harmonic in S approach uniformly the function u(x, y), neces- 
sarily harmonic in S: 


lim m = 0, |u(z,y) — un(z,y)| Sn for (2, y) in S. 


no 


Then we have also 


du(x,y)  dun(z, y) | 
ra < M'm 
ax ax = Mm, 








du(z, y) al du,(z, y) < M'n (x y) in Si, 
ay By yer ateho i 


\f'(z) — falz)| < J/2M'n, zinS:, 


ou . OU Ou . OU 
where “2z) = — —i— i naete on seen 
(2) =o i ay’ fa(z) gt 


and where M’ is defined as in Lemma I. 
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Theorem 1 follows at once from Lemma I. The functions f’(z) and A (z) are 
of course analytic in S, respective derivatives of the analytic functions 


f(z) = ula, y) + w(z, y), fn(Z) = un(z, y) + iv,(z, y), 


where v(x, y) and v,(2, y) are conjugate to u(z, y) and u,(z, y) respectively in S. 
TaroreM 2. Under the hypothesis of Theorem | let (a, b) be a fixed point interior 
to S:;. Then we have 


| [v(x, y) — v(a, b)] — [vn(z, y) — vn(a, b)]| S M’’n,, (x, y) in Si, 
|[f(z) — iv(a, b)] — [fn(2) — tvn(a, bd) |S V1+ Mm, zinSs, 
where 
f(z) = ula, y) + we, y), — fn(2) = Un(z, y) + wvn(2, y), 


where v(x, y) and v,(x, y) are conjugate to u(x, y) and u,(x, y) respectively in S, 
and where M'" is defined as in Lemma III. 

If the sequence v,(x, y) converges to v(x, y) at even a single point of S,, then 
»,(a, b) converges to v(a, b), so v,(x, y) converges to v(x, y) uniformly in S,. If the 
sequence f,(z) converges to f(z) at even a single point of S, , then v,(a, b) converges to 
v(a, b), so fn(z) converges to f(z) uniformly in S, . 

Theorem 2 follows at once from Lemma III. 

TororEM 3. Let S be a limited simply connected region, and let S, be an 
arbitrary closed subregion interior to S bounded by an analytic Jordan curve and 
containing a fixed point (a,b). Let the functions u(x, y) and un(x, y) be harmonic 
in 8, and suppose we have 


ou OUn 


8 lim €, = 
(8) Im € 0, lle eS 


no 








<e, for (x, y) inS. 


Then we have also 





tute, y) + un(z, y)] — [u(a, b) — un(a, b)) — ES b) — Pua 4 (y — b) | 


s Men, (zx, y) in Si, 
av(a,b) _ av,(a, 4 on 

ox Ox 
< Men, (z, y) in Si, 





(oe, ») — ea(e, wl — fla, B) — ena, 89] ~ | 








(9) 








Ue) — B40 — Wa) — jad] +s] PE — BE Nie 0) | 5 VE Me 
zin Si ’ 
where f(2) = u(x, y) + iv(x, y), f.(z) = Un(2, y) + iv,(2, y), a=a+ ib, 


where v(x, y) and v,(x, y) are conjugate to u(x, y) and u,(z, y) respoctionty 4 in S, 
and where M“” is defined as in Lemma IV. 
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The first of inequalities (9) follows at once from LemmalIV. The inequality 
that occurs in (8) can also be written 


dv % OVn 
oy oy 


= €n; 








so the second of inequalities (9) also follows from Lemma IV. The first and 
second of inequalities (9) imply the third of those inequalities. 

Theorems 1, 2, and 3 concerning sequences of harmonic functions in general 
are now to be applied in particular to the study of sequences of harmonic 


polynomials. 
A polynomial in the complex variable z of degree n is any function which can be 
written in the form az" + az” + --- + a,. <A harmonic polynomial in x 


and y of degree n is the real part of a polynomial in z = x + ty of degree n. 
Thus if p,»(z, y) is a harmonic polynomial, with P,(z) = pr(x, y) + iqn(z, y), 
the function q,(x, y) conjugate to p,(x, y) (uniquely defined to within an additive 
constant) is also a harmonic polynomial of degree n, the real part of —7P,(z), a 
polynomial in z of degree n. Under these conditions the functions dp,/dz and 
ap,/dy are also harmonic polynomials, of degree n — 1, for we have 


! — ODn -OGn _ 9Pn _ . On 
ieidieals eee: ake elkose 


4. Degree of convergence, harmonic polynomials. The following convention 
is usual [op. cit., p. 65] and is to be retained throughout the sequel: 

Notation. Let C be an arbitrary closed limited point set of the z(= x + ty)- 
plane whose complement K with respect to the extended plane is connected, and 1s 
regular in the sense that K possesses a Green’s function G(x, y) with pole at infinity. 
The function w = g(z) = e°**", where H is conjugate to G in K, maps K con- 
formally but not necessarily uniformly onto the exterior of the circle | w,| = 1 so 
that the points at infinity in the two planes correspond to each other. Then Cr 
shall denote generically the locus | p(z) | = R > 1 (or G(x, y) = log R > 0) exterior 
to C; that is to say, Cz is the image of the circle | w | = R > 1 under the transforma- 
tion w = g(z). The case R = = is not excluded; the notation C,, refers to the 
point at infinity, and the “interior” of C., is considered to be the entire plane with 
the omission of the point at infinity. 

The fundamental result concerning degree of convergence and overcon- 
vergence of sequences of harmonic polynomials is” 

THEOREM 4. Let C be a closed limited point set composed of the mutually exterior 
closed Jordan regions J, , Je, +--+, dx. . 





5 Theorem 4 has already been established for the case k = 1; Walsh, Bull. Amer. Math. 
Soc., vol. 33 (1927), pp. 591-598. 
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If the function u(x, y) ts defined merely on C, and if the harmonic polynomials 
palx, y) of respective degrees n satisfy the relation 
(10) lim wn” =1/p <1, 4m = max[| u(z, y) — p,(z, y) |, (x, y) on CI, 
then the sequence p,(x, y) converges throughout the interior of C,, uniformly on 
any closed point set interior to C,. Consequently the function u(x, y) can be 
extended harmonically from C along paths interior to C, so as to be single-valued and 
harmonic throughout the interior of C, . 

If the function u(x, y) ts single-valued and harmonic on C, there exists a number p 
finite or infinite such that u(x, y) (or its harmonic extension) is single-valued and 
harmonic throughout the interior of C, but not throughout the interior of any C,- , 
o > p. There exists a sequence of harmonic polynomials p,(x, y) of respective 
degrees n such that (10) is valid, but there exists no sequence of harmonic poly- 
nomials p(x, y) of respective degrees n such that we have 
(11) lim yn” me 1/p’ < 1/p, Mn = Max [| u(a, y) = pra, y) F (x, y) on C). 

Let u(x, y) be now defined on C, let (10) be valid, and let R be arbitrary, 
1<R<p. Choose R; so that R < Ri <p. From (10) we have 


(12) | u(x, y) — pplz, y)| Ss —o 
1 


where M is suitably chosen. In the region J; (j = 1, 2, --- ,k) we choose a 
point (a; , b;), and define the functions 


(z,y) 
ou ou 
u(r, y) = ——dzr+—d 
iW) ie ( dy r Ox v) 


(z,y) 
OPn OPn ) 
nj(& = _ dx + d 
q if ’ y) . ( oy ° Ox y }» 


(x, y) on C, 








conjugate respectively to u(x, y) and p,(z, y) in J;. Of course the functions 
v(x, y) and q,;(x, y) depend effectively on j. 
Let us now set 


fi(z) = u(x, y) + w(x, y), P32) cS PrlX, y) + 1Qnj(X, y), 


‘o that f;(z) is analytic interior to J; , and P,,;(2) is a polynomial in z of degree n. 
We obtain at once from (12) by Theorem 1 


(18) If@) —Pi@| st, zonJj, 
Ri 
Where J; is an arbitrary closed region interior to J; bounded by an analytic 
Jordan curve, and where M; is suitably chosen. 
If the region J ; is suitably chosen, the closed region J; lies [op. cit., §2.1, 
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Theorem 2] interior to [J : . Forzon{[J jlevie and consequently for z on J j 
we have [op. cit., §4.7, Corollary to Theorem 8] from (13) 


(14) | fi(z) = P,,;(z) | = = zon Ji, 


where Mz is suitably chosen;° and Mz need not depend on j. 

We come now to the reason for the detailed study of the derivatives f(z) 
and P’,;(z) rather than the functions f;(z) and P,;(z) themselves; it is precisely 
in this respect that the proof of Theorem 4 for k > 1 differs essentially from the 
proof for k = 1: the derivatives f;(z) and P’,;(z) are independent of j. Indeed, we 
may write 

. OV; ou poi ou 


/ ou 
L{@ = = + 1 +e “oy? 
(15) 
=z OP» _ ; OPe 


ax ax ay” 





Pajle) = ee 


The function P;,;(z) is a polynomial in z of degree n — 1, so inequality (14) valid 
on C for every R < p implies [op. cit., §4.6, Theorem 6] the convergence of the 
sequence P,;(z) to a function which may be denoted by f;(z), throughout the 
interior of C, , uniformly on any closed set interior to C, . 

It is appropriate to mention the independence of 7 not merely of P’,;(z) but 
also of f;(z), for several points (a; , b;) may lie interior to a single Jordan region 
bounded by C, or a part of C,. We need to know that in such a region fi(2) 
does not depend on 7. 

On any closed set J interior to C, containing a point (a; , b;) we have from 
Theorem 3 


(16) lim p,(z, y) = u(z, y), uniformly in J, 

for p,(a;, b;) approaches u(a;, b,;), and by (14) the derivative ap,(a; , b;)/dy 
approaches du(a;, b;)/dy. Each of the finite regions into which C, separates 
the plane contains a point (a;, b;). The sequence pn{z, y) thus converges to a 
function which may be denoted by u(z, y), throughout the interior of C,, 
uniformly on any closed set interior to C,. Hence the function u(z, y) can be 
extended harmonically from C so as to be single-valued and harmonic throughout 
the interior of C,. The first part of Theorem 4 is established. 





6 This reasoning is used many times in the book mentioned, for instance pp. 96-97. We 
may also prove and apply the following general theorem: 

Let C be a closed limited Jordan region; suppose u < 1 is given, and we have on every closed 
subset C’ interior to C 


lim {max [| f(z) — pa(z) |, zon C’)}}"" < pw, 


where py(z) is a polynomial in z of degreen. Then we have likewise 


lim {max [| f(z) — pa(z) |, zon C]}"™ < wp. 
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Suppose now u(z, y) is given single-valued and harmonic on C. The number p 
of Theorem 4 exists [compare the method of op. cit., p. 58]. Let v(z, y) be a 
function conjugate to u(x, y) interior to C, ; if the interior of C, consists of 
several regions there need be no relation between the values of u(x, y) or between 
the values of v(x, y) in these various regions. The function f(z) = u(z, y) + 
in(z, y) is single-valued and analytic throughout the interior of C,. Con- 
sequently [op. cit., p. 79] there exist polynomials P,,(z) in z of respective degrees n 
such that we have for an arbitrary R < p 


Ife) — Pale)| so, zone, 
where M is suitably chosen. Therefore we have also 
M 
| u(z, y) Be oy prlx, y) | s R”’ (x, y) on C, 


where P,(z) = pa(x, y) + i@n(z, y). This inequality yields 


(17) lim PE ob ~* 1/p, Mn = Max [| u(x, y) aa paz, y) l, (a, y) on C}. 

Either the inequality in (17) or equation (11) would imply (by the part of 
Theorem 4 already established) that u(z, y) is single-valued and harmonic 
throughout the interior of some C,-, p’ > p, contrary to the definition of p. 
Then equation (10) and the impossibility of equation (11) are established, so the 
proof of Theorem 4 is complete. 

A proposition less specific but somewhat simpler than Theorem 4 is: Let C 
be a closed limited point set consisting of the mutually exterior closed Jordan regions 
Ji,Jo,+++,Jy. Let the function u(z, y) be defined and single-valued on C. A 
necessary and sufficient condition that u(x, y) be harmonic on C is that there exist 
harmonic polynomials p,(x, y) of respective degrees n such that equation (10) is 
fulfilled, where p is some number greater than unity. 


5. Degree of convergence on C,. A more explicit result than equation (16) 
can be obtained by inspection of the proof of (16). Let us choose an arbitrary 
C,,0 < p, having no multiple point. Let J be the closed interior of an arbitrary 
one of the analytic Jordan curves composing C, ; then J contains in its interior 
at least one of the points (a;, b;). Inequality (14) implies [op. cit., p. 81, 
Corollary to Theorem 8] 
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z on or within C,, o <R, 


Where M; is suitably chosen. Consequently we may write for z in J 


Myo" M0” 
< : 
— RP’ < 


a(u ae Dn) 
Ox 














a(u ny Dn) 
dy 




















Si 


330 J. L. WALSH 


As a particular case under (12) we have 





| u(a;,b;) — pa(a;, b;)| S = Ri >R, 


n? 
1 


so by Lemma II we write the inequality 


Mic" 
Re...” 





(18) |u(z, y) — prlz,y)| S 


valid in each of the closed regions J, thus (with the requisite modification in 
M, if necessary) valid throughout the closed interior of C, . 

Inequality (18) has for convenience been established only when C, has no 
multiple points. If now C, has multiple points we may choose R’, R < R’ < p, 
and may choose o’, ¢ < o’ < ck’/R, in such a manner that C,- has no multiple 
points. By the proof of (18) we have 


(19) ’eei~adeia u.(Z) < M, (7) 


throughout the closed interior of C,, , hence throughout the closed interior of (,. 
The arbitrariness of R < p in (18) and (19) allows us now to formulate’ 

TueoreM 5. Let C be a closed limited point set consisting of a finite number of 
mutually exterior closed Jordan regions. Let the function u(x, y) be defined on C, 
and let the harmonic polynomials p,(x, y) of respective degrees n satisfy (10). 
Then we have also (1 < o < p) 


(20) lim [un(o)}"" So/p, uno) = max|| u(x, y) — pal, y) |, (x, y) on Cl). 


If under the conditions of Theorem 5 the function wu(z, y) is single-valued and 
harmonic throughout the interior of C, but not throughout the interior of any 
C,, p’ > p, then the inequality in (20) is impossible. This follows from the 
relation [C.],;. = C, and from the last part of Theorem 4. That is to say, we 
have proved the 

Corotiary. Under the hypothesis of Theorem 5 let u(x, y) be single-valued and 
harmonic throughout the interior of C, but not throughout the interior of any C,, 
p’>p. Then we have 


lim [un(o)]""" = o/p. 





7 Both Theorem 5 and the first part of Theorem 4 are easily proved also from the fol- 
lowing: 

Lemma. Let C satisfy the hypothesis of Theorem4. Let there be given R and Ri,1< Ri < 
R. There exists a number N depending only on C, R, and R,, such that for every harmonic 
polynomial rn(x, y) of degree n of modulus not greater than n on C we have 


| mn(z, y) | S Nak”, (x, y) on or within Cay. 


This lemma may be proved by Lemma I, by double application to the function Omn/ az 4 
10n,/dy of the analogue [op. cit., p. 77] of the present lemma first to the point sets J; used in 
the proof of (14) and second to the set C, and finally by Lemma II. 
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It is convenient to formulate for reference several other results whose proofs 
are essentially contained in the proof of Theorems 4 and 5: 

TuroreM 6. Let C be a closed limited ;point set consisting of the mutually 
exterior closed Jordan regions Ji, J2,---,J,. Let the function u(x, y) be 
defined on C, and let the harmonic polynomials p,(x, y) of respective degrees n 
satisfy the relation 
(21) iim[v.(J;)I" S1/o <1, »n(Jj;) = max[| u(x, y) — p,(z, y)|, (x, y) in Jj), 
for every closed Jordan region J ; interior to J; bounded by an analytic Jordan 
curve. Then we have also 
(22) lim yn” - 1/p, Mn = Max [| u(z, y) ii pr(, y) l, (x, y) on Ci. 

Inequality (21) instead of equation (10) is by Theorem 1 sufficient for the 
proof of (13), and hence is sufficient for the conclusion of Theorem 5. But we 
may obviously write 


Hn S ma(o), 
whence from (20) 
lim un” < o/p. 

This last inequality holds for every ¢, 1 < o < p, and hence yields (22). 

TororEM7. Let C be a closed limited point set consisting of the mutually exterior 
closed Jordan regions J1, Jo, --- , J. Let the function u(x, y) be defined on C, 
and let the harmonic polynomials p,(x, y) of respective degrees n satisfy the relation 
(22). Then we have also 


(23) lim er s 1/p, An = max [| f’(2) a P,(z) I, zon Cl, 
where f(z) = du/ax — idu/dy, P..(2) = ap,/dx — idpn/dy. 


In the case k = 1 we have 


(24) lim x" S$ 1/p, Xe = max|| f(z) — Pa(2)|, 2 on C} 
where f@) = ut w, Pra(z) = pa + ign , and where v(x, y) and qa(zx, y) are con- 
Jugate respectively in C to u(x, y) and p,(x, y), and vanish in a fixed point (a, b) 
interior to C. 

Inequality (23) follows at once from Theorem 1 and Theorem 6. 

Inequality (24) follows from Theorem 2 and Theorem 6. 


6. Maximal convergence. Theorem 4 suggests of itself the formulation of the 
Derinition. Let C be a closed limited point set consisting of a finite number of 
mutually exterior closed Jordan regions J;,J2,---,J%. Let the function u(z, y) 
(or its harmonic extension) be single-valued and harmonic throughout the interior of 
C, but not throughout the interior of any C,, p’ > p. If a sequence of harmonic 
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polynomials p,(x, y) of respective degrees n satisfies (10), then the sequence p,(z, y) 
is said to converge to u(x, y) on C maximally, or with the greatest geometric degree of 
convergence. 

Whenever C and u(z, y) are given, it follows from Theorem 4 that there exists q 
sequence p,(x, y) converging maximally to u(z, y) on C. The property of maxi- 
mal convergence is shared by many sequences of polynomials with extremal 
properties (see §12 below), so we study in some detail the implications of maximal 
convergence. 

TuroreM 8. Let C be a closed limited point set consisting of a finite number of 
mutually exterior closed Jordan regions. Let the function u(x, y) be single-valued 
and harmonic throughout the interior of C, but not throughout the interior of any 
C,,p’ > p. Let the sequence of harmonic polynomials p,(x, y) of respective degrees 
n be given. A necessary and sufficient condition that the sequence p,(x, y) converge 
maximally to u(x, y) on C is that the sequence p,(x, y) converge maximally to u(z, y) 
on the closed interior of every C,,0 < p. 

Let the sequence p,(x, y) converge maximally to u(x, y) on C. By the relation 
[Celoje = C,, maximal convergence of the sequence p,(x, y) to u(x, y) on the 
closed interior of C, follows from the Corollary to Theorem 5. To be sure, maxi- 
mal convergence on the closed interior of C, is as yet strictly defined only in the 
case that C, has no multiple points. But definition and proof extend readily 
(compare §10 below) to the general case. 

Conversely, let us suppose for every ¢ < p 


lim [un(o)]” = a/p, Un(o) = max [| u(z, y) cr p(x, y) l, (x, y) on C.). 


We obviously have yn S un(o), whence for every « < p 


lim uy” < o/p. 
Here we allow o to approach unity and notice that by Theorem 4 the left-hand 
member cannot be less than 1/p; we obtain (10). 

Regions of uniform convergence of sequences converging maximally can be 
determined with a fair degree of completeness: 

TuroreM 9. Let C, u(x, y), and p,(x, y) satisfy the conditions of the italicized 
definition. Then on and within every C,(o < p) the sequence pn(x, y) converges 
uniformly to u(x, y); but the sequence p,(x, y) converges uniformly in no region 
containing in its interior a point of C, . 

The first part of Theorem 9 is included in Theorem 8; we proceed to study the 
second part. Suppose the sequence p,(z, y) converges uniformly in some simply- 
connected region D containing in its interior a point (x1. yi) of C,; we shall 
reach a contradiction. The region D overlaps C,, throughout the interior of 
which p,(z, y) converges to u(z, y), so the limit of p,(z, y) may be denoted by 
u(x, y) in D even exterior to C,. 
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If we define as usual 


° , OPn ° ODn 
—— a —- Pp, ss a 


it follows from Theorem 1 that the equation 


lim Pi,(z) = f (2) 


holds uniformly in some neighborhood of (2; , y:1). It then follows from Theorem 
7 and from a general result on sequences of polynomials in the complex variable,* 
that this equation holds uniformly on any closed set interior to some C,, , p’ > p. 
It follows from Theorem 3 as in the proof of (16) that the sequence p,(z, y) 
converges throughout the interior of C,-, uniformly on any closed set interior to 
C,, and hence that u(x, y) can be harmonically extended from C so as to be 
single-valued and harmonic throughout the interior of C,,, contrary to hypoth- 
esis. This contradiction completes the proof of Theorem 9. 

By inspection of the proof just given we can formulate the slightly more 
general 

Corottary. Let C be a closed limited point set consisting of a finite number of 
mutually exterior closed Jordan regions. Let the function u(x, y) be defined on C, 
and let the harmonic polynomials p,(x, y) of respective degrees n satisfy (10). 
If the sequence p,(x, y) converges uniformly in a region having in its interior a 
point of C, , then for a suttable value of ¢ < p, and for some closed region D contain- 
ing a point of C, in its interior, we have 

lim {max[] u(x, y) — pa(z, y) |, (2, y) in D]}" S o/p; 

the sequence pn(x, y) converges throughout the interior of some C,, p' > p, uni- 
formly on any closed set interior toC,. Thus u(z, y) is single-valued and harmonic 
throughout the interior of C, . 

Theorem 9 asserts the impossibility of uniform convergence of the sequence 





* Let C be an arbitrary closed limited point set whose complement is connected and regular. 
Let &(z) be defined on C and let the polynomials xq(z) of respective degrees n satisfy the condition 


lim 24” = 1/p <1, An = max [| (z) — ,(z)|, z on C). 
n- 


If the sequence w»(z) converges uniformly in some closed region containing a point of C, in its 


interior, then for a suitable value of ¢ < p, and for some closed region D containing a point of 
C, in its interior, we have 


lim {max [| ®(z) — xa(z) |, 2 in D}}"" s o/p; 


the sequence rn(z) converges throughout the interior of some Cy, p’ > p, uniformly on any 
closed set interior to Cpr. 


All the material for the proof of this theorem is given [op. cit., §4.8], but a corollary of 
the theorem is formulated rather than the theorem itself. 
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Px(z, y) in any region containing in its interior a point of C,. Nevertheless the 
sequence p,(z, y) may converge uniformly on a Jordan are which is partly in- 
terior and partly exterior to C,. For instance let us use polar codrdinates 
(r, 6), and let us set 


palt,y) = gar sin (20), ule, y) = lim p(x, y). 

We may choose C as the unit circle: r S 1, so Cz is the curve r = R. We 
obviously have p = 3. But in the point r = 3, @ = 1/3, the sequence p,(z, y) - 
Pn—1(2, y) fails to approach zero with 1/n, the sequence p,(zx, y) cannot converge, 
hence the point r = 3, 6 = 7/3 cannot lie interior to C,. This establishes the 
inequality p < 3 and the equation p = 3. Nevertheless we see by inspection 
that the sequence p,(xz, y) converges uniformly on every ray @ = mz/2',0 < 
r <*oo, where m and q are integers. 

A result somewhat similar to the Corollary to Theorem 9 is 

THEOREM 10. Let C be a closed limited point set consisting of mutually exterior 
closed Jordan regions J;, J2,+--, J. Let the function u(x, y) be defined on C, 
and let the harmonic polynomials p,(x, y) of respective degrees n satisfy (10). 
If for a single value of j we have 


(25) Tim {max |lu(x, y) — pa(x, y) |, (2, y) on Jil} S 1/p"” < 1/p, 


then the sequence p,(x, y) converges throughout the interior of some C,., p’ > p, 
uniformly on any closed set interior to Cy. Thus u(x, y) is single-valued and 
harmonic throughout the interior of C,- . 

In particular inequality (25) cannot hold if the sequence p,(x, y) converges 
maximally to u(x, y) on C, where p has the usual significance. 

By Theorem 5 we have from inequality (25) for at least one j 


lim {max [| u(z,y) — pa(z, y) |, (x,y) on [Jilprzol}””” S 1/p. 


ne 


Let C’ denote the point set composed of C plus the closed interior of [J jl»: 
If p’”’ is suitably chosen, the complement of C’ is connected and regular. From 
(10) and the inequality just proved we may write 


lim {max || u(z, y) — pax, y) |, (@,y) on C’}}"" < 1/p. 


It follows from Theorem 4 that the sequence p,(z, y) converges to u(?, y) 
interior to C;, uniformly on any closed set interior to C, ; hence u(z, y) is single- 
valued and harmonic throughout the interior of C;. But the locus C’, contains 
in its interior C, and hence also some C,,, so the proof is complete. 

The proof of Theorem 10 requires only minor changes if (25) is replaced by 


(26) lim {max [| u(x, y) — pr(x, y) |, (x, y) in D]}"” S 1/p” < 1/p, 
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where D is a closed Jordan region to which belongs either a point exterior to C 
or a point on the boundary of C; the conclusion of Theorem 10 remains valid. 

Consequently, if the sequence p,(z, y) converges maximally to u(z, y) on C, 
where p has the usual significance, inequality (26) is impossible for such a region 
D. Under these conditions on u(z, y), px(x, y), C, and p, and if D is an arbitrary 
closed Jordan region which is contained in C but to which belongs a boundary 
point of D, then the left-hand member of (26) is equal to 1/p; if D is an arbitrary 
closed Jordan region whose interior lies interior to some C,(¢ < p) but one of 
whose boundary points lies on C, , then the left-hand member of (26) is equal 
to a/p. 

This more general proposition has immediate application, which we shall not 
mention in further detail, to many of the sequences considered in §§7-9 below.” 


7. Differentiation of maximal sequences. It is natural to inquire whether 
the operations of differentiation and integration alter the property of maximal 
convergence. We shall now discuss these questions, always with the notation 
and hypothesis of the italicized definition of maximal convergence. 

THEorREM 11. Let the sequence p,(x, y) converge maximally to u(x, y) on C. 
Then we have 


lim “1,” = 1/p, sim = max[|du/axr — dap,/ar|, (z, y) on Cl, 


no 


(27) 
max [| du/dy — dpn/dy|, (x,y) on C]. 


ee 
lim Hon” — 1/p, Hen 


no 


In particular if du/dx [or du/dy] is single-valued and harmonic throughout the 
interior of no C, , p’ > p, and therefore always in the case k = 1, then the sequence 
dp,/dx [or dp,/ dy] converges maximally to du/adx [or du/dy] on C. 





* The analogue of the more general result can be proved in an analogous manner: 
Let C be a closed limited point set whose complement is connected and regular. Let f(z) be 
defined on C, and let polynomials pp(z) in z of respective degrees n satisfy the relations 


lim {max [| f(z) — pa(z) |, zon C]}"™ = 1/p <1, 


(26") lim {max [| f(z) — pa(z)|, zon D]}/™ = 1/p” < 1/p, 


where Dis a closed set whose complement is connected and regular, and where D contains either a 
point exterior to C or a point on the boundary of C. Then the sequence py(z) converges through- 
out the interior of some Cyr, p! > p, uniformly on any closed set interior to Cpr. Thus f(z) is 
single-valued and analytic throughout the interior of Co. 

. Consequently, if the sequence py(z) converges maximally to f(z) on C, where p has the usual 
significance lop. cit., §4.7] inequality (26’) is impossible for such a set D. Under these con- 
ditions on f(z), pa(z), C, and p, and if D (closed, with complement connected and regular) 
belongs to C but contains a point of the boundary of C, then the left-hand member of (26') is 
equal 10 1/p; if every point of D (closed, with complement connected and regular) lies on or 
nlerior to some Cg (a < p), but if at least one boundary point of D lies on Co, then the left-hand 
member of (26’) is equal to ¢/p. 
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From Theorem 7 we have 


(28) jim ut.” < 1/p, lim 2,” S 1/p. 


Neither strong inequality can hold here. Suppose for instance we have 


(29) jim min” = 1/p’ < 1/p, iim 2.” S 1/p. 
The function dp,/dz is a harmonic polynomial of degree n — 1, or if we prefer of 
degree n. It follows from Theorem 4 that we have 


lim 22x ¥) _ aula, y) 
no Ox Ox 





throughout the interior of C,, uniformly on any closed set interior to C,. 
Each of the finite regions into which C, separates the plane contains in its 
interior at least one of the closed Jordan regions J ; composing C, and interior to 
each region J; we choose a point (a;, b;). By the maximal convergence of 
pr(x, y) to u(x, y) on C we have 


lim Dnla;, b;) = u(a;, b;). 


By the second of relations (29) we have 


lim Opn(a;, bj) _ du(a;, b) 
ne oy oy 


From Theorem 3 it follows that p,(z, y) approaches u(x, y) throughout the 
interior of C,-, uniformly on any closed set interior to C,, and hence that 
u(x, y) is single-valued and harmonic throughout the interior of C,’, p’ > p. 
This contradicts the definition of p, and thereby proves the impossibility of (29). 
In a precisely similar way it can be shown that the simultaneous relations 

lim ui,” S 1/p, lim ua” = 1/p’ < 1/p 


nwo no 





are impossible, so equations (27) are established. 

If du/dx [or du/dy] is harmonic at a point, so also is du/dy [or du/dz], hence 
so also is u(x, y). Thus in the case k = 1 the function u(z, y) has a singularity 
on C,, and both the functions au/ax and au/dy have singularities on C,. 
Theorem 11 is completely proved. 

It is to be noticed that under the hypothesis of Theorem 11 with k > 1 we 
cannot assert the maximal convergence on C of the sequences @p,/dx and 
dp,/dy to the respective functions du/dx and du/dy. For instance let us choose 
k = 2, u(z, y) equal to 1 and 2in J, and Jz respectively. Then we have du/dt = 
0, du/dy = 0, p < ~, so by (27) the sequences ap,/dx and dap,/dy do not con- 
verge maximally on C. 

It may also occur in Theorem 11 that du/dz is single-valued and harmonic 
throughout the interior of no C,,, p’ > p, whereas du/dy does not possess that 
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property. This situation arises for instance if C, separates the plane into 
precisely two finite regions, with du/dzx zero and unity in these respective regions, 
and du/dy zero in both regions. 

Theorem 11 establishes more specific results than Theorems 1 and 7, under a 
more restricted hypothesis. 

Under the general conditions of Theorem 11, the sequence dp,/dz [or ap,/ay] 
cannot converge uniformly in a region containing in its interior a point of C, . 
For if this were to occur, the sequences dp,/dy [or dp,/dx] and p,(zx, y)} would 
also converge uniformly in such a region, by Theorems 2 and 3 respectively, in 
contradiction to Theorem 9. Thus we have proved the 

CoroLtLtary. Under the hypothesis of Theorem 11, neither of the sequences 
dp,/dx, dp,/dy can converge uniformly in a region containing in its interior a 
point of C, . 

The method used in the proof of Theorem 11 and its Corollary applies by 
iteration to the study of the higher partial derivatives of p,(x, y) and u(z, y). 


In each case we have 
I/n 
, (x, y) on o}} = 


(30) lim {max| 
The sequence d°**p,,/dx'ay’ can converge uniformly in no region containing in 
itsinterior a point of C,. If a°*’w/axr‘dy’ is single-valued and harmonic through- 
out the interior of no C,,, p’ > p (this always occurs if k = 1), equation (30) 
indicates maximal convergence on C of the sequence d°*’p,/az‘dy’ to the function 
a u/ax‘ay’. 

It is worth mentioning in connection with Theorem 11 that not merely equa- 
tions (27) are valid but also the equations 


__ inks 1/n 
lim {max| = a oe , (z, y) on 1]} = 1/p, 


ait uU a*t4 p 
axt ay? Oat ay? 











. 
= 








ox Ox 
(31) 1/ 
_ ou 0 n 7 
Ti, {max |2¢ - < , (x, y) on Ji} = 1/p, 





for every j. If equations (31) do not hold, at least one of those equations (say 
the former) is to be replaced by the corresponding inequality (the left-hand 
member is less than the right-hand member) for some value of 7. From Theorem 
10 follows the convergence of the sequence dp,,/dx to the function du/dz through- 
out the interior of some C, , p’ > p, uniformly on any closed set interior to C,: . 
This is impossible, by the method used in the proof of Theorem 11. 

Equations similar to (31) hold also for the higher partial derivatives of 
u(z, y) and Px»(z, y), and may be similarly established.” 


ee ye) 


* A similar proof yields the analogue: 
Let C be a closed limited point set whose complement is connected and regular. Let Ci 
be a closed subset of C which contains at least one point of the boundary of C and whose comple- 
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From Theorem 7 we have 


(28) lim win” S 1/p, lim urn” S 1/p. 


no 


Neither strong inequality can hold here. Suppose for instance we have 


(29) lim ui,” = 1/p’ < 1/p, lim y2..” < 1/p. 
The function dp,/dz is a harmonic polynomial of degree n — 1, or if we prefer of 
degree n. It follows from Theorem 4 that we have 





Kim 22n@y) _ ula, y) 
no Ox Ox 


throughout the interior of C,,, uniformly on any closed set interior to (,.. 
Each of the finite regions into which C,, separates the plane contains in its 
interior at least one of the closed Jordan regions J ; composing C, and interior to 
each region J; we choose a point (a;, b;). By the maximal convergence of 
Pr(x, y) to u(x, y) on C we have 


lim Dnla;, b;) = u(a;, b;). 


By the second of relations (29) we have 


lim dpn(a;, b;) os du(a;, bi) 
no oy oy 


From Theorem 3 it follows that p,(z, y) approaches u(x, y) throughout the 
interior of C,,, uniformly on any closed set interior to C,, and hence that 
u(x, y) is single-valued and harmonic throughout the interior of C,, p’ > p. 
This contradicts the definition of p, and thereby proves the impossibility of (29). 
In a precisely similar way it can be shown that the simultaneous relations 

lim “in” S 1/p, lim pan” = 1/p’ < 1/p 


no 





are impossible, so equations (27) are established. 

If du/dzx [or du/dy] is harmonic at a point, so also is du/dy [or du/dz], hence 
so also is u(x, y). Thus in the case k = 1 the function u(z, y) has a singularity 
on C,, and both the functions du/az and au/ay have singularities on ¢,. 
Theorem 11 is completely proved. 

It is to be noticed that under the hypothesis of Theorem 11 with k > 1 we 
cannot assert the maximal convergence on C of the sequences dp,/d« and 
dp,/dy to the respective functions du/dx and au/ay. For instance let us choose 
k = 2, u(x, y) equal to 1 and 2in J; and Jz respectively. Then we have du/dt = 
0, du/dy = 0, p < ~, so by (27) the sequences dap,/dax and ap,/dy do not con- 
verge maximally on C. ; 

It may also occur in Theorem 11 that du/dz is single-valued and harmonlc 
throughout the interior of no C,-, p’ > p, whereas du/dy does not possess that 
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property. This situation arises for instance if C, separates the plane into 
precisely two finite regions, with du/dz zero and unity in these respective regions, 
and du/dy zero in both regions. 

Theorem 11 establishes more specific results than Theorems 1 and 7, under a 
more restricted hypothesis. 

Under the general conditions of Theorem 11, the sequence dp,/dzx [or ap,/dy] 
cannot converge uniformly in a region containing in its interior a point of C, . 
For if this were to occur, the sequences dp,/dy [or ap,/dax] and p,(z, y) would 
also converge uniformly in such a region, by Theorems 2 and 3 respectively, in 
contradiction to Theorem 9. Thus we have proved the 

Corottary. Under the hypothesis of Theorem 11, neither of the sequences 
dp,/dx, Opn/dy can converge uniformly in a region containing in its interior a 
point of C, . 

The method used in the proof of Theorem 11 and its Corollary applies by 
iteration to the study of the higher partial derivatives of p,(z, y) and u(z, y). 


In each case we have 
1/n 1 
, (x, y) on o}} =-, 
p 
i+j 


(30) lim {max | 
The sequence d°*’p,/ax‘dy’ can converge uniformly in no region containing in 
itsinterior a point of C,. If a°**u/ax‘ay’ is single-valued and harmonic through- 
out the interior of no C,, p’ > p (this always occurs if k = 1), equation (30) 
indicates maximal convergence on C of the sequence d°*’p,/az‘dy’ to the function 
a u/ac‘ay’. 

It is worth mentioning in connection with Theorem 11 that not merely equa- 
tions (27) are valid but also the equations 


gt u at? p 
aziay’ ax ay? 

















1/n 
no Ox Ox 
(31) ; 1/ 
ca Gu _ Pr : ee 
lim {max | ay ay ’ (x, y) on Al = 1/p, 








for every j. If equations (31) do not hold, at least one of those equations (say 
the former) is to be replaced by the corresponding inequality (the left-hand 
member is less than the right-hand member) for some value of 7. From Theorem 
10 follows the convergence of the sequence dp,/dx to the function du/dz through- 
out the interior of some C,, , p’ > p, uniformly on any closed set interior to C,, . 
This is impossible, by the method used in the proof of Theorem 11. 

Equations similar to (31) hold also for the higher partial derivatives of 
u(z, y) and p,(z, y), and may be similarly established.” 


aaa cn g 


“A similar proof yields the analogue: 
a C be a closed limited point set whose complement is connected and regular. Let C, 
a closed subset of C which contains at least one point of the boundary of C and whose comple- 























338 J. L. WALSH 


8. Integration of maximal sequences. Let p,(x, y) be a harmonic polynomial 
of degree n, and let ¢,(z, y) be a conjugate function: 





(z,y) 
OPn OPn ) 
n = qdn\a, Ree cal d. < 
(32) daz, y) = anl0,0) + J” (—2PP ae 4 2 ay 
Then the functions 
(zy) 
P,(x, y) = P,(a,b) + / c (pr(z, y)dx — n(x, y) dy), 


(33) (x,y) 
Q,(z, y) = Q,(a, b) + i AY (qn(z, y) dx + pn(zx, y) dy), 


are harmonic polynomials of degree n + 1, for the function 
zt+iy 
Pale, u) + iQaCe, ») = Pala, ) + Qu(0,6) +f” ” (oa + iad 
is a polynomial in z of degreen + 1. The following theorem suggests itself: 
THEoREM 12. Let the sequence of harmonic polynomials p,(x, y) converge 
maximally to the function u(x, y) in the closed Jordan region C. Let (a, b) bea 
fixed point interior toC. Let us introduce the notation (32) and (33) together with 


. ? (tae 4. May) 
v(x, y) = v(a, b) + be (2 dx + an 4Y q 
(z,y) 
U(z, y) = U(a, b) + [ (u(x, y)dx — v(z, y) dy), 


(34) (z,y) 
V(z, y) = Via, b) + i . (v(x, y) dx + u(z, y) dy), 


where the integrals are to be taken over paths interior to C, (usual notation). Let us 
assume 


lim | (a,b) — gn(a, b) |" < 1/p. 


no 


If we have 
lim | U(a, b) — P,,(a, b) |" < 1/p, 


nea 





ment is connected and regular. Let the sequence of polynomials pp(z) of respective degrees 
converge maximally to f(z) onC. Then we have also 


lim {max [| f(z) — ppa®(z) |, zon Ci} = 1/p, 


where p has the usual significance. The conclusion continues to hold if C, is replaced by C. 
The sequence p?(z) converges uniformly in no region containing in its interior a point 0 
Cy. If f(z) is single-valued and analytic throughout the interior of no Cp’, p’ > Py then the 
sequence p{? (z) converges maximally to f” (z) on C. 
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then the sequence P,(x, y) converges maximally to U(x, y) onC. If we have 


lim | V(a,b) — Q,(a, b) ["" < 1/p, 
then the sequence Q,(x, y) converges maximally to V(x, y) on C. 

Since u(x, y) is harmonic throughout the interior of C, but not throughout the 
interior of any C,- , p’ > p, the same is true of v(z, y) and of U(z, y) and V(z, y). 
Theorem 12 is then a consequence of Theorems 2, 3, and 6.” 

Theorem 12 does not extend at once to the case of a point set C which consists 
of several mutually exterior closed Jordan regions J;, J2,---,J;. To be 
sure, under these conditions a point (a; , b;) can be chosen interior to each J; , 
and functions v,(z, y); Qni(Z, y), U (2, y), V x(z, y), P,.,(z, y), Q, (x, y) can be 
defined by equations analogous to those already used. But these functions 
P,,(z, y) and Q,;(x, y) may depend effectively on j; it may be impossible to 
define such functions as polynomials independent of j. 

We illustrate the fact just mentioned by an example already used elsewhere 
(op. cit., pp. 82-83]. Let C be the point set | z’ — 1| < 1/4, and let u(z, y) 
in the left-hand oval of the curve | 27 — 1 | = 1/2 be identically zero and in the 
right-hand oval be 3 log [(a — 34/2)’ + y’]. The function u(z, y) is harmonic 
throughout the interior of C, but not throughout the interior of any C,, , p’ > p, 
where p = 2”. Let the polynomials p,(z, y) be found by taking the real part 
of the expansion of the function f(z) = u(x, y) + iv(x, y) (where v(z, y) is con- 
jugate to u(x, y) on C) in a Jacobi series: 
fle) = ao + ax(z + 1) + an(e — 1) + aa(e — 1)@ +1) + +++, lim Ja, |" = 2". 


We may choose (a; , b:1) as (—1, 0) and (a2, be) as (1,0). We then have by (33) 


1 


P,(1, 0) — P,(—1, 0) = [ “pale, vdde, 


and this sequence diverges [op. cit., p. 83]. That is to say, it is impossible to 
find a harmonic polynomial P,(2, y) represented by (33) such that the two 
sequences P,,(1, 0) and P,,(—1, 0) simultaneously converge. 

It may naturally occur in particular instances when C consists of mutually 





" Suppose we have An 20, B, =0, 


lim AY™ <1/p, lim B/" S$ 1/p, pp > 1. 
neo 


IfR < pis arbitrary, we then have for sufficiently large n 
A, 3 1/R’, B, = 1/R", A, + Bn S 2/R", 
lim [An + B,]/" < 1/R, 


neo 


whence by the arbitrary character of R < p, 


lim [An + B,]'!* = 1/p. 


no 
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exterior closed Jordan regions Ji, Jz, --- , Jz that functions »;(z, y), U;(2, y), 
V x(x, y), and polynomials g,;(x, y), Pri(z, y), Qni(z, y) can be defined in J ; 80 
as to be independent of j. Under such circumstances Theorem 12 can be 
directly extended. In any case there can be established the 

Corouuary. Let the sequence of harmonic polynomials p,(x, y) converge 
maximally to the function u(x, y) on the point set C consisting of the mutually 
exterior closed Jordan regions Ji, J2, +++ ,J%. Let the point (a; , b;) lie interior 
to Jj . Let the functions v(x, y); Qni(2, y); U,(2, Y); V ,(2, y), P,;(z, y), Q, (2, y) 
be defined by equations analogous to those used in Theorem 12, where we have for 


every J 
lim | v;(a;, bj) — qni(aj, b;) |"" < 1/p. 


If for a particular j we have 
lim | U;(a;, bj) — Paj(a;, b;) |"" S 1/p, 


ne 


then we have for that j 
(35) lim {max | U;(x, y) — Pas(2, y) |, (, y) on Js}""" = 1/p. 


If for a particular j we have 
lim | V;(a;, b;) 4 Qnj(a;, b;) Pe Ss 1/p, 


then we have for that j 
(36) lim {max | V(x, y) — Quiz, y) |", (w, y) on Jy}"" = 1/p. 


Neither the sequence P,,;(x, y) nor the sequence Q,;(x, y) converges uniformly ina 
region having in its interior a point of C, . 

The method of proof of Theorem 12 shows that the left-hand member of (38) 
is less than or equal to the right-hand member. This relation cannot be less 
than, by Theorems 1, 6, and 10. Thus (35) is established, and (36) can be 
established similarly. ; 

The last part of the Corollary follows from Theorem 1 and Theorem 9. 

The conclusions and methods of proof of both Theorem 12 and the Corollary 
extend to iterated integrals. 


9. Related sequences of polynomials in z. We have repeatedly indicated 
the analogy between maximal convergence of harmonic polynomials and maxi- 
mal convergence of polynomials in z. We undertake now to study the relation 
in more detail, as concerns the obtaining of one kind of maximal sequence from 


the other. 
THreorEM 13. Let C be a closed limited point set consisting of the mutually 
exterior closed Jordan regions J;, Jo,---,J,%. Let the sequence of polynomials 


Pn(z) = p»(x, y) + iqn(x, y) of respective degrees n converge maximally on C to the 
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function f(z) = u(z, y) + tv(z, y). Let f(z) be single-valued and analytic through- 
out the interior of C, but not throughout the interior of any C,:, p’ > p. Then we 
have 


lim {max || u(x, y) — pa(z, y) |, (2, y) on C)}"" = 1/p, 


ne 


OT) ae Gaede ag + ale 2) |; Ge p) on. Cll" = 1/- 


ne 
We may even write for every j 


lim {max [| u(x, y) his p(x, y) |, (z, y) on Jil} 1/p, 


neo 


38 pel 
ge ee Saas Gee Si Yn 


no 


Neither of the sequences p,(x, y) and qn(x, y) converges uniformly in a region con- 
taining in its interior a point of C, . 

In particular if u(x, y) [or v(x, y)] is single-valued and harmonic throughout the 
interior of no C, , p’ > p, and this situation always occurs if k = 1, the sequence 
pa(2, y) [or qn(x, y)] converges maximally to u(x, y) [or v(x, y)] on C. 

The sequence p,(z) converges uniformly in no region containing in its interior a 
point of C, [op. cit., p. 83]. If pa(z, y) [or q,[z, y)] were to converge uniformly in 
such a region, so also would q,(z, y) [or pa(z, y)], by Theorem 2, and so would 
p,(z). Thus neither of the sequences p,(z, y) and q,(z, y) converges uniformly 
in such a region. 

From the relation of maximal convergence 


lim {max [| f(z) — pa(z) |, zon C]}"” = 1/p 


and the obvious relations 
| u(z, y) bas Pal, y) | S | f(z) Jute Dn(2) | ’ | v(x, y) ag Qn(2, y) | Ss | f(z) 7 Pn(z) |, 


it follows that the left-hand members in (37) are less than or equal to the right- 
handmembers. From Theorem 4 it follows that neither of these relations can be 
less than. Thus (37) is established. Equations (38) are a consequence of 
Theorem 10. 

If f(z) has a singularity on C, , so also have both u(z, y) and o(z, y). If either 
of the functions u(z, y) and v(z, y) has a singularity on C, , so also has the other, 
and likewise the function f(z). The last part of Theorem 13 now follows, and 
the proof is complete. 

It may readily occur in Theorem 13 that u(z, y) is single-valued and harmonic 
throughout the interior of no C,», p’ > p, whereas v(z, y) does not possess that 
Property; compare the illustration given under Theorem 11. But at least one 
of the functions u(x, y) and v(x, y) must possess that property. 
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Corotiary. Under the hypothesis of Theorem 13, equation (30) is valid. 
Indeed we have for every m 
Wn y 
, (x, y) on J} =<, 


lim {max | 
no : p 


The sequence d°**p,/ax' dy’ can converge uniformly in no region containing in its 
interior a point of C, . 

The proof of the Corollary is similar to the discussion of §7, and is left to the 
reader. 

Under the hypothesis of Theorem 13, the sequences investigated in §8 can be 
studied by methods already developed in detail. The conclusions are precisely 
those of §8; both formulation and proof of the new results are left to the reader. 

TureoreM 14. Let the sequence of harmonic polynomials p,(x, y) converge 
maximally to the function u(x, y) in the closed Jordan region C. Let (a, b) bea 
fixed point interior to C. Let v(x, y) and qn(x, y) be conjugate to u(x, y) and 
Pr(x, y) respectively in C, where 

lim | v(a, b) — qa(a, b) |" < 1/p. 


nec 


atu a ap, 


ax'day? dx‘ dy’ 











Then the sequences q,(x, y) and p,»(z) = pn(x, y) + ign(x, y) converge maximally 
to v(x, y) and f(z) = u(z, y) + iv(z, y) respectively in C. 

Since u(x, y) has a singularity on C,, so also have v(x, y) and f(z). The 
conclusion of Theorem 14 follows from Theorem 2, and from Theorem 6 applied 
to the sequence qn(z, y). 

Theorem 14 does not extend without further qualifications to the case of a 
point set C which consists of several mutually exterior closed Jordan regions, 
as we propose to show by the special example already used in connection with 
Theorem 12. In the notation of §8, we set v(—1, 0) = v(1, 0) = 0, ¢,(—1, 0) = 
qn(1, 0) = 0; this choice is allowable, for the Jacobi series is found by inter- 
polation in the points (—1, 0) and (+1, 0) to the function u(z, y) + iv(2, 9). 
Let us denote the closed interior of the left-hand oval of the lemniscate | z’ — 1 | 
= 1/4 by J; and the closed interior of the right-hand oval by Jz. Let us set 
(a1, bi) = (—1, 0), (a2, be) = (1, 0), Vi(a;, b;) = Qui(ai, b;) = 0. The function 
f(z) is real on the axis of reals, so the coefficients in the Jacobi development of 
f(z) are real. Thus we have q,(z, 0) = 0, 


(1,0) 
(qn(z, y) dx + p(x, y) dy) ag 0, 


—1,0 


since the integral is independent of the path. Then the polynomial Qn;(2, y) 
ts independent of j. Similarly we have 


(1,0) 
i na (v(x, y)dx + u(z, y)dy) = 0, 


where the integral is taken along the axis of reals, and the function V;(2, y) 
is independent of j. It follows from the Corollary to Theorem 12 that the 
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sequence Q,;(z, y) = Qn(z, y) converges maximally to V,(z, y) = V(z, y). 
On the other hand, it is impossible [§8] to determine polynomials —P,(z, y) 
conjugate to the polynomials Q,(z, y) so that we have simultaneous convergence 
even of the two sequences —P,(—1, 0) and —P,(1, 0). 

An obvious extension of Theorem 14 exists if C consists now of mutually 
exterior closed Jordan regions J; , J2, --- , Jz, provided the functions q,;(zx, y) 
and v,(x, y) conjugate to p,(z, y) and u(z, y) in J; are allowed to depend on j; 
compare the Corollary to Theorem 12. The situation is somewhat simpler with 
derivatives: © 

TurorEM 15. Let C be a closed limited point set consisting of the mutually 
exterior closed Jordan regions J;, J2,---,Jx. Let the sequence of harmonic 
polynomials pn(x, y) of respective degrees n converge maximally to u(x, y) on C, 
let the functions qn(x, y) and v(x, y) be conjugate to p,(x, y) and u(x, y) respectively 
on C, and let us set Pu(2) = Pn(x, y) + in(a, y), f(z) = u(x, y) + (x,y). Then 
we have for every j, and form = 1, 2,---, 


lim {max [| f(z)-— p&”(z) |, zon C]}" 


ne 


1/p, 


lim {max [| f(z) — px” (z) |, zon J,)}"" = 1/p. 


neo 


The sequence p',””(z) (n = 0, 1, 2, --- ) converges uniformly in no region containing 


in its interior a point of C,. In particular if f(z) is single-valued and analytic 
throughout the interior of no C, ,.p’ > p (this situation always occurs if k = 1), 
then the sequence p” (z) converges maximally to f(z) on C. 

Theorem 15 follows at once from the methods of §7. 


THEOREM 16. Under the hypothesis of Theorem 12 let us set 


P,(z) = P,(x, y) + 2Q,(2, y), F(z) = U(z, y) + iV(z, y). 
If we have 
lim | F(a + ib) — P,(a + 2b) |"" < 1/p, 


then the sequence P,(z) converges maximally to F(z) on C. 

Theorem 16 follows from Theorem 12, and a similar result can be formulated 
from the Corollary to Theorem 12 for the case that C consists of a finite number 
of mutually exterior closed Jordan regions. Both Theorem 16 and this similar 
result extend to higher repeated integrals of the function f(z) = u(z, y) + iv(z, y). 


10. More general point sets C. Maximal convergence for sequences of 
polynomials in z is defined [op. cit., p. 80] for an arbitrary point set C whose 
complement K is connected, and is regular in the sense that Green’s function 
exists for K with pole at infinity. We have hitherto (§§6-9) considered as 
point sets C on which maximal convergence of harmonic polynomials is defined, a 
lore restricted class, namely closed limited sets each consisting of a finite number 
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of mutually exterior closed Jordan regions. This restriction in the study of 
harmonic polynomials is partly from necessity, partly for the sake of convenience. 

For instance it is not desirable to define maximal convergence of a sequence of 
harmonic polynomials p,(z, y) of respective degrees n on a segment C of the 
axis of reals. Such a sequence may converge uniformly on C to a function 
u(x, y) in such a way that we have 


lim {max || u(x, y) — pa(zx, y)|, (x, y) on C]}"” = 0, 


and yet converge at no point not on the axis of reals. This phenomenon occurs 
if we set for instance u(z, y) = 0 on C, po(z, y) = 0, pa(z, y) = nly for n = 1. 
A complete set of harmonic polynomials {r” cos n@, r” sin n6}, where polar co- 
ordinates are used, is not linearly independent on C. A second disadvantage of 
the use of a segment of the axis of reals as C is that a harmonic function whose 
values are given on C is not uniquely defined by those values. The addition to 
such a function of a polynomial 2.1 a,r” sin n@ does not change the values on C 
of the function or its harmonic character in any region of the plane. 

However, it is by no means essential in the theory thus far developed to 
require that C should be composed of only Jordan regions. Let J be an arbitrary 
closed limited simply connected region, which therefore separates the plane 
into a number of regions. Denote by K that one of those regions which is 
infinite, let G(z, y) be Green’s function for K with pole at infinity, and let J; 
denote generically the locus G(z, y) = log R > Oin K. If R’ > 1 is given, there 
exists [op. cit., p. 28, Theorem 2; the result is essentially due to Carathéodory] a 
closed Jordan region J’ such that J lies interior to J x’. It follows [op. cit., 
p. 81, Corollary] that the inequality for every J’ 


| f(z) — pa(z)| S M/R", zon J’, 


where p,(z) is a polynomial of degree n, where Ri < R is given, and where M 
may depend on J’, implies the inequality 


| f(z) — pa(z) | S Mi/Ri, zon Jaye, or zon J, 


where M, is suitably chosen, and where J is the complement of K. We mention 
explicitly that J may contain points exterior to J. 

More generally, let C be a closed limited point set whose complement X is 
connected and regular, let simply connected regions J; , J2, --- , Jx belong toC; 
let C have the property that corresponding to an arbitrary R’ there exist Jordan 
regions Jj, Jz, --- , Jz interior to J 1, Jo, +++ , Jz respectively such that every 
point of C lies interior to at least one of the curves [Ji]e’, [J 4 , [Jile- 
Let the concept of maximal convergence (§6) be extended to include such a set ¢. 
Then the entire discussion of §§4—9 so far as concerns a set C consisting of k mutually 
exterior closed Jordan regions applies with only obvious modification in the present 
more general case. The entire discussion of §§4-9 so far as concerns 4 single 
Jordan region applies with only obvious modifications to an arbitrary closed limited 
simply-connected region J. But in such a situation as the last part of Theorem 4, 
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we need now to assume u(z, y) harmonic not merely on J but also throughout 
the complement of K. The complement of K may contain points exterior to J, 
points which are interior to every Jr. 

Hitherto we have required that C should consist of a finite number of closed 
Jordan regions, instead of admitting more general regions, merely for the sake of 
simplicity. We shall continue to make that requirement for the same reason. 
But the succeeding material except in obvious cases extends of itself to include 
the more general point sets defined above. Even that extension does not 
completely exhaust the range of the methods used [compare op. cit., §§5.6 
and 5.7]. 


11. Various measures of approximation. Measures of approximation other 
than that used in (10) are of interest, and are now to be studied. As a matter of 
convenience we prove first several lemmas. 

Lemma IV. Let S bea finite region bounded by a Jordan curve T possessing con- 
tinuous curvature, and let S, be an arbitrary closed subregion of S. There exists a 
number N’ depending only on S and 8, , such that if U(x, y) is harmonic interior to S 
and continuous in the corresponding closed region, and if we set 


(39) i 
r 


then we have 





U(a, y) |?ds = 7”, p>, 


(40) | U(x, y) | S N’n, (z, y)in S:. 
The function U(z, y) is represented interior to S by Green’s formula 
(41) U(a, y) = J U(a, B) at d8 (4.8) ’ 
2r Jr dv 


where G(x, y; a, 8) is Green’s function for S with pole in the point (zx, y) and 
running coérdinates (a, 8). The normal derivative aG/dv is continuous on I. 
We apply Hélder’s inequality 


frm|s(fir) (Jim) cea 


to the integral in the right-hand member of (41), with F* = U, H”* = aG/a», 
a=1/p. Inequality (40) follows at once from (39) and (41). 

Lemma V. Let S be an arbitrary limited simply connected region, and let S, 
be an arbitrary closed subregion of S. There exists a constant N” depending only 
on Sand 8 such that if U(x, y) is harmonic interior to S and if we set 


(43) [ [ive y)|?dS =7", p>l, 


then we have 


(42) 








(44) |U(x,y)|<$.N"n, (2, y) in Ss. 























346 J. L. WALSH 


Let Q denote an arbitrary circle interior to S, and let r and (x, y) be re- 
spectively the radius and center of @. Gauss’s mean value theorem yields 


by an integration 


1 
U(xo, Yo) = 7 If U(z, y) dS. 
Inequality (42) yields 


1 ‘. 1/p 
| U(x, yo) | S (rr®)il> (| [ | UC, 9) | ds) 


This right-hand member is by (48) not greater than n/ (ar), If5 > Ois less 
than the minimum distance from S, to the boundary of S, inequality (44) is valid 
with N” = 1/(r8°)'””. 

Lemma VI. Let S be a closed limited simply connected region, and let S, be an 
arbitrary closed subregion of S. Then there exists a constant N’’’ depending only 
on S and S, such that if U(x, y) is harmonic in S, if S is mapped conformally onto 
the interior of the unit circle y :| w| = 1, and if we set 


(45) i LUG abl ate PRE 
7 


then we have 
(46) |U(@,y)| SN’, (2, y)inS,. 


Let the functions w = #(z), z = W(w) (assumed fixed) map the interior of § 
onto the interior of y. Then lim,..:,,<1 ¥(re”) exists for almost all values of 0; a 
value 6 = 6’ for which this limit exists corresponds to an accessible boundary 
point of S, and the radius @ = 6’ of y corresponds to a Jordan arc abutting on the 
boundary of S in ¥(e"’). The limit of U(z, y) along this are is the value in 
the end-point. Thus the integrand in (45) is defined almost everywhere on 7; 
the integral exists, for the integrand on 7 is the limit as r approaches unity of the 
continuous function defined on the circle | w|= r < 1 as the transform of 
U(x, y). The integral in (45) (and in similar situations) is intended to be taken 
in the sense just suggested, namely that limiting values of U(z, y) are to be 
used on y. 

Poisson’s integral for U(z, y) is valid for the interior of the circle | w | =r <1, 
hence for the interior of y. Inequality (46) follows as in the proof of Lemma IV. 

Lemma IV-VI yield results on degree of convergence analogous to Theorem 4: 

THEOREM 17. Let C be a closed limited point set consisting of the closed interiors 
of mutually exterior Jordan curves T;, T2,---, Tx, each possessing continuous 
curvature. Let the continuous function u(x, y) be defined merely on the curves lj. 
A necessary and sufficient condition that the harmonic polynomials p,(2, y) f 
respective degrees n satisfy the relation 


we k 1/np 1 
(47) lim ‘Z| | u(x, y) — pale, y) Pas} a= Gh erk 
gl JT; if 
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is that we have 


(48) lim {max | u(x, y) — pa(x, y) |, (x, y) on C}"" = -. 

In the first part of Theorem 4 it is a matter of indifference whether the given 
function u(x, y) is defined at all points of C, or merely on the boundary of C, 
for the given sequence p,(x, y) obviously converges uniformly on the boundary 
of C, hence uniformly on C; the function u(x, y) is obviously harmonic in the 
interior points of C, continuous on C. In Theorem 17 we assume u(z, y) defined 
merely on the curves I’; , SO we are not yet in a position to apply Lemma IV. 

If (47) is given, we may write 


M 
(49) I | u(z,y) — paz, y) "ds Som, 1<n<s, 
rj 1 
where M is suitably chosen. The general inequality 
(50) [xi + x2? $2?" E[xal? +] xe!"], p>], 
implies 


M 
[ inne, y) — Pra(a, y) |"ds S ae 
whence by Lemma IV 
. . M2 
(51) | pr(x, y) — pnalz, y)| S 7? (x, y) on S;, 


where S; is an arbitrary closed set interior to [T;. From the first part of 
Theorem 4 it now follows that the sequence p,(x, y) converges throughout the 
interior of [S;],, , uniformly on any closed set interior to [S,],, , to some function 
U(z, y). Hence U(z, y) is defined and harmonic at all points interior to [S,],, , 
in particular [by the arbitrariness of S; interior to I’; ; see op. cit., p. 28] at all 
points on or within T;. Moreover we have by Theorem 5 


(52) |U(x,y) — plz, y)| S =, (x,y) on Tj, 


where 72 < 7; is arbitrary, and where M; (depending on 72) is suitably chosen. 
Inequality (52) yields 


4 


M 
[ | U(2, y) es prlk, y) |? ds ~ np) 
Tj T2 


which together with (49) implies by virtue of (50) 


1; 


A 
[ | U(x, y) — u(z, y) |’ds S =p. 
Tj T2 


The functions U(z, y) and u(z, y), being both continuous on IT; , are therefore 
identical, and it is allowable to denote by u(x, y) the limit of p,(z, y) on and 
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within T';. It is essential in the first part of Theorem 17 to require that u(z, y) 

be continuous on I; if we are to conclude the identity on I’; of u(z, y) and the 

limit of the sequence p,(x, y); otherwise we can conclude only the equality of 

U(z, y) and u(x, y) almost everywhere on Tj. This situation is to be expected, 

for (47) involves the values of u(x, y) only almost everywhere on I; . 
Inequality (52) implies the inequality 


lim {max [| u(z, y) ne ple, y) I, (x, y) on C] Tate = 1/r 


ne 


so the left-hand member of (48) is less than or equal to the right-hand member. 
But the relation less than would imply the relation less than instead of equality 
in (47). Thus equation (48) is established. 

The converse proposition follows at once, namely that (48) implies (47). 
Equation (48) implies that the first member of (47) is less than or equal to the 
second member. But the relation less than would by the part of Theorem 17 
already proved imply the relation less than instead of equality in (48), contrary to 
hypothesis. 

An immediate consequence of Theorem 17 is the 

Corotuary. Let C satisfy the conditions of Theorem 17. Let the function 
u(x, y) be single-valued and harmonic throughout the interior of C, , but not through- 
out the interior of any C,, p’ > p. A necessary and sufficient condition that the 
sequence of harmonic polynomials p,(x, y) of respective degrees n converge maximally 
to u(x, y) on C is 


k 
(53) lim iz 3 | u(x, y) — pala, y) |? as) 
nO \7= i 
There exists no sequence p,(x, y) such that the left-hand member of (53) is less 
than 1/p. 


The proofs of the following two theorems and corollaries are so similar to the 
proofs just given that they are left to the reader. 

TuEorEeM 18. Let C be a closed limited point set consisting of the mutually 
exterior closed Jordan regions Ji, J2,---,J%. Let the continuous function 
u(x, y) be defined merely on C. A necessary and sufficient condition that the 
harmonic polynomials p,(x, y) of respective degrees n satisfy the relation 


—" 1/np 
lim ff | u(x, y) — pala, y) ras} = : <.T, p> 1, 
Cc 


no 


is equation (48). 

Corotiary. Let C satisfy the conditions of Theorem 18. Let the function 
u(x, y) be single-valued and harmonic throughout the interior of C,, , but not through- 
out the interior of any C,, p' > p. A necessary and sufficient condition that the 
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sequence of harmonic polynomials p,(x, y) of respective degrees n converge maximally 
to u(x, y) on C is 


(54) lim ff | u(x, y) — pala, y) ras} a - 


There exists no sequence p,(x, y) such that the left-hand member of (54) is less 
than 1/p. 

TurorEM 19. Let C be a closed limited point set consisting of the mutually 
exterior closed Jordan regions J,, J2,---,J%. Let the region J; be mapped 
onto the interior of y; :|w;| = 1 by a fixed transformation. Let the continuous 
function u(x, y) be defined a) on the boundary of C or b) on the closed set C. A 
necessary and sufficient condition that the harmonic polynomials p,(x, y) of respec- 
tive degrees n satisfy the relation 


k 1/np 
pe 1 
a) lim »2 | w(x, y) — pr(ax, y) |? | dw; } --<1, p>t, 
neo j=l Jj 
orb) lim > If | | u(x, y) — pa(x, y) ras} =-<l,. pol, 
neo \j=1 jwj|S1 


is that equation (48) be fulfilled. 

CoroLttaRy. Let C satisfy the conditions of Theorem 19, and let the function 
u(x, y) be single-valued and harmonic throughout the interior of C, but not through- 
out the interior of any C,, p’ > p. Equivalent necessary and sufficient conditions 
that the sequence of harmonic polynomials p,(x, y) of respective degrees n converge 
maximally to u(x, y) on C are 


k 1/np 
1 
lim iz | u(x, y) oa Prl2, y) |? | dw; I = p’ Pp > 1, 
neo 17= s 


Yi 


Pee k 1/np 1 
lim > If | lute, y) — Pal, y) ras} sail ghd > 1. 
oT a wild 


There exists no sequence pn(x, y) such that the left-hand member of either equation 
in (55) is less than 1/p. 


(55) 


12. Best approximation. Let the point set C consist of mutually exterior 
closed Jordan regions J; , Je, --- , Jz, and let the function u(z, y) be harmonic 
on C. The approximation to u(x, y) on C of a harmonic polynomial p,(z, y) 
can be measured in any one of a number of ways, such as: 


) max [n(z, y) | u(x, y) — pala, y) |, (x, y) on Cl]; 


‘ k 
ii) : ‘ n(x, y) | u(x, y) — pa(x, y) |? ds, p>, 


7=1 i 
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where each region J ; is assumed to be bounded by a curve 1; which has continuous 
curvature ; 


iii) | I n(x, y)| u(x, y) — pr(z,y)|?dS, p>, 


iv) 2. n;(w;) | u(z, y) Ts Prl2, y) |” | dw; |; p> 1, 


J FV4 


where the interior of J ; is mapped onto the interior of y; : | w;| = 1; 


”) Df mead inlay) — pales y) dS, p>, 


j=l 
where the interior of J; is mapped onto the interior of y;:|w;| = 1. 


The norm function n(z, y) or n;(w;) is for the present to be considered positive 
and continuous where defined. When 7 is given, there always exists at least one 
harmonic polynomial of degree n of best approximation, that is to say, a poly- 
nomial of degree n for which the measure of approximation is at least as small as 
for any other polynomial of degree n. In case i), the polynomial of best approxi- 
mation may or may not be unique; in cases ii)-v) the polynomial of best 
approximation is always unique. The latter remark follows (this is the usual 
proof) from a detailed use of inequality (50): if two distinct polynomials of 
degree n are given, which have the same approximation to u(x, y) on C, half 
their sum is actually a polynomial of degree n of better approximation, and 
differs from the two given polynomials. 

The measures of approximation iv) and v) naturally depend on the particular 
conformal map, not uniquely determined, of the interior of J; onto the interior 
of y;. But each of these measures of approximation with given map and given 
norm function n,(w,), is equivalent to the corresponding measure of approxima- 
tion with an arbitrary map and a suitable norm function. 

In the following theorem, any polynomial of best approximation will suffice if 
such a polynomial is not unique: 

Turorem 20. Let C be a closed limited point set consisting of the mutually 
exterior closed Jordan regions J:, J2,-+-, Ju. Let u(x, y) be harmonic on C, 
and let I1,(x, y) be the harmonic polynomial of degree n of best approximation to 
u(x, y) on C, in any of the senses i)-v). Then the sequence I1,(x, y) converges 
maximally to u(x, y) on C. 

Case iii) is typical; the reader will easily make the appropriate modification 
for the other cases. Suppose we have 


(56) 0 <M < n(z, y) < Ne, (x, y) on C, 


where N; and Ne are constants. If p,(z, y) denotes a sequence of harmonic 
polynomials converging maximally to u(x, y) on C, we have from (54) and from 
the last of inequalities (56), 


lim ff n(x, y)| u(x, y) — pala, y) ras} = -. 
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For the polynomials II,(2, y) of best approximation we have a fortiori 


I/np 


fim { { fm, w)|w(e,y) — hte, y) Pas} ” 


1 
ayer / p 


From the second of inequalities (56) we may write 


1/np 
jim if [iwc ag ag u)/ras} of 
n7e 4 


From the Corollary to Theorem 18 it follows that the inequality here is im- 
possible, and that the sequence II,(z, y) converges maximally to u(z, y) on C. 

For k = 1, n(z, y) = 1 or n,j(w;) = 1, C the closed interior of a circle, 
method ii) reduces essentially to trigonometric approximation on the circum- 
ference, a topic which has been recently studied by Lebesgue, de la Vallée 
Poussin, S. Bernstein, Jackson, and others. 

The measure of approximation i) is that of Tchebycheff, studied by Walsh and 
Julia in the case k = 1, n(x, y) = 1; and by Faber in the case k = 1, n(x, y) = 1, 
for approximation by polynomials in z. The measure ii) has been used in the 
case k = 1, p = 2, n(x, y) = 1 by S. Bernstein, Brioullin, Picone, and Merri- 
man,” and has been used by Szegé and Lavrentieff in the analogous situation for 
polynomials in the complex variable. The measure iii) has been used in the 
case k = 1, n(x, y) = 1 by Zaremba”™ and Bergmann, and has been used by 
Carleman and Bochner in the analogous field of the complex variable. The 
measure iv) was used by Julia for harmonic polynomials in the case k = 1, 
n{w;) = 1. The most essential part of Theorem 20 for the case k = 1, but 
without the concept and detailed properties of maximal convergence, was 
proved in two papers by the present writer; further detailed references to the 
literature are given there. Theorem 20 in its present form is somewhat similar 
to, but more specific than, results obtained by Walsh and Russell’ for approxima- 
tion by polynomials in the complex variable on a set C not necessarily connected. 

In case i) with n(x, y) = 1, it is immaterial whether approximation is measured 
on C or on the boundary of C. Consequently we have here a theoretical method 
for the solution of the Dirichlet problem. The polynomials II,(z, y) can be 
determined merely from the values of the function u(z, y) on the boundary of C. 
The sequence II,,(x, y) converges uniformly on the closed set C and thus defines 
u(z, y) interior to C."° A similar remark applies to both ii) and iv). 





* American Journal of Mathematics, vol. 53 (1931), pp. 589-596. 

* Bulletin de l’ Académie de Cracovie, 1909. 

“Trans. Amer. Math. Soc., vol. 32 (1930), pp. 794-816; vol. 33 (1931), pp. 370-388. 

* Trans. Amer. Math. Soc., vol. 36 (1934), pp. 13-28. 

In cases i), ii), and iv) we have here a theoretical method for the solution of the Dirich- 
let problem even if u(x, y) is merely continuous on C, harmonic interior to C; the poly- 
homials I,(z, y) of best approximation can be determined merely from the values of the 
funetion u(z, y) on the boundary of C; the sequence II,,(z, y) converges interior to C, uni- 
formly on any closed set interior to C (uniformly on C itself in case i)), and thus defines 
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The additional relative advantage of the measure of approximation ii) lies 
in its naturalness and simplicity, and in the fact that for p = 2 the formal 
expansion is one in orthogonal functions. The disadvantage of it (in comparison 
with i), iii), iv), and v) lies in the heavy restrictions on the regions to which it 
applies. The advantage of iv) is that it applies to arbitrary simply connected 
regions, and involves the values of u(z, y) only on the boundary of those regions 
(or on the circumference y;). 


13. Extensions. We have already indicated in §10 that our results apply toa 
point set C much more general than that composed of a finite number of Jordan 
regions. The remarks of §10 apply at once to the results of §§11 and 12. But 
in the measure of approximation ii), it is to be noted that the boundary of C must 
be at least rectifiable for the integral to have a meaning; in addition, the con- 
tinuity or some other requirement concerning dG/dv on the boundary of C is 
essential for the reasoning as given. 

In Theorem 20 we have assumed for convenience that the norm functions 
n(x, y) and n;(w;) are positive and continuous. It is sufficient, however, in 
case i) if the norm function is bounded and bounded from zero. It is sufficient 
in cases ii)-v) [method of op. cit., §5.7] if the norm function is integrable, 
together with its (—8)*® power, where B = 1/(p — 1). 

In §§11 and 12 we have studied approximation only in the case p > 1. The 
case p = 1 can also be treated by similar methods and yields corresponding 
results. But the case 0 < p < 1 is quite different. We are not in a position to 
show the failure of Theorem 20 in the latter case, but shall now show by an 
example the failure of LemmalIV. Let us define the continuous function U,,(r, 4) 
on the unit circle [ :r = 1 by the requirements 


U,(1, —7z) = 0, U,(1, —2/n) = 0, U,(1, 0) = 
U,(1, r/n) = 0, U,(1, x) = 0 


where U,(1, 6) is a linear function of @ between the successive values given. 
There exists a function U,(r, 6) harmonic interior to I’, continuous in the 
corresponding closed region S, which coincides with U,(1, 0) on T. We have by 
Gauss’s mean value theorem 


U,(0, 0) = = [ U,(1, 6) do = 





u(x, y) interior to C. This remark follows from the possibility of uniform approximation 
of u(x, y) on C by harmonic polynomials; that is to say, there exists a sequence of harmonic 
polynomials p,(z, y) of respective degrees n such that the measure of approximation i) 
approaches zero as n becomes infinite; measures i), ii), and iv) consequently approach zero 
for the polynomials II,(z, y); we apply Lemma IV and an obvious extension of Lemma VI. 
The remark applies in cases i) and iv) even when the regions J; composing C are not Jordan 
regions, but if the boundary of C is not the boundary of an infinite region, the function 
u(z, y) defined continuously on the boundary of C is then not entirely arbitrary; compar 
Walsh, Crelle’s Journal, vol. 159 (1928), pp. 197-209. 
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We also have (0 < p < 1) 


Fd ® grt. 
ail, 0 rdo = | U1, OP a = ————. 
f [UaCl, do =f 10. oP do = A 
This last expression approaches zero as n becomes infinite, so even for a fixed S, 
containing the origin there exists no N’ such that (40) is fulfilled. 
Let us consider the situation of Theorem 20, case ii). The harmonic poly- 
nomials 


(57) 1, r cos 6, r sin 6, r* cos 26, r° sin 20, --- , r” sin N@ 


are linearly independent on the boundary [of C. For, any linear combination L 
of these polynomials which vanishes on I’, vanishes throughout the interior of C, 
and therefore vanishes identically in the entire plane. In particular L vanishes 
identically on the unit circle y, hence (since the functions 1, cos @, sin 6, --- , 
sin N@ are linearly independent on ) all the coefficients in L are zero. It is 
consequently possible to orthogonalize and normalize the functions (57) on T 
with respect to the positive continuous norm function n(x, y). We denote by 


Po(z, y), Pi(a, y), Pe(x, y), --- 


the polynomials resulting from this process. The harmonic polynomials 
Pon,s(z, y) and Pen(x, y) are both of degree n. 

If u(z, y) is given harmonic on and within C, the harmonic polynomial II,(z, y) 
of degree n of best approximation to u(x, y) on T in the sense of least squares 
(p = 2) is the sum of the first 2n + 1 terms of the series 


kd k 
(58) u(z,y) © LeamPm(2,¥), am = 2 _, ma, y)ula, y)Pm(2, y) ds. 
m= oe i 

The actual formulas for the functions P»(z, y) can be written down at once 
(Merriman, loc. cit.], but asymptotic formulas for those functions have never 
been obtained. Nevertheless, methods developed elsewhere [op. cit., §§6.6 and 
6.7] together with the results of the present paper, enable us to establish certain 
results on the coefficients a, in (58) and on the convergence of the series (58). 

THEOREM 21. In the situation of Theorem 20 case ii), p = 2, let the function 
u(x, y) be single-valued and harmonic throughout the interior of C, , but not through- 
out the interior of any C,-, p’ > p. Let the norm function n(x, y) be positive and 
continuous on the boundary of C. Then the coefficients am in the formal expansion 
(58) of u(x, y) satisfy the relation 


(59) iim | dm pite ia 1/p”, 

Reciprocally, if numbers am are chosen arbitrarily so as to satisfy (59), the 
Junction u(x, y) defined by (58) is single-valued and harmonic throughout the 
mlerior of C, , but not throughout the interior of any C,:, p’ > p. 
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It is of course possible throughout the present paper to consider degree of 
approximation not by harmonic polynomials of respective degrees n, but by 
harmonic polynomials p,.(z, y) which are respectively linear combinations of 
the first m + 1 of the functions 1, r cos 6, r sin 8, r’ cos 20,---. The results in the 
latter case are entirely analogous to the results in the former, and the results in 
either case may be readily established from the results in the other case. But 
in the latter case, the number p which now appears in such a relation as (10) 
is to be replaced by p. 

THroreM 22. Under the hypothesis of the first part of Theorem 21, the series 
(58) can converge at every point of no region exterior to C, . 

Under the conditions of Theorem 22, |.ne segments of uniform convergence of 
the series (58) may be everywhere dense in the plane, as is illustrated by the 
example given in §6. 

The conclusions of Theorems 21 and 22 apply also to the situations of Theorem 
20, cases iii)-v), with p = 2. Details of the proof of Theorems 21 and 22 and of 
these extensions are left to the reader. 


HARVARD UNIVERSITY. 
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TENSOR COORDINATES OF LINEAR SPACES 
By WALLACE GIVENS 
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|. Introduction. Homogeneous codrdinates of linear subspaces of a projec- 
tive space were defined by Grassman and have been studied by Severi,’ Anto- 
nelli,’ and others. These studies have not, however, employed the notation and 
methods of the tensor calculus. In this paper we define contravariant and co- 
variant tensor codérdinates of the linear subspaces of a projective space and 
derive ab initio the quadratic relations which they satisfy. 

These codrdinates enable us to give elegant algebraic expression to the geo- 
metric operation of perspection and section, where the center of perspection is a 
linear space of any number of dimensions. ‘Two expressions are found for the 
coérdinate tensor of the join and intersection of two spaces and in verifying the 
equivalence of these two expressions we are led to a useful identity. 

In §5 we introduce a non-singular quadratic form into the geometry and 
employ the matrix of this form to lower and raise tensor indices in the familiar 
fashion. We then derive some of the properties of the linear spaces which lie 
ona quadric. In particular, we find the algebraic characterization of the two 
families of rulings on a quadric in a complex projective space of 2v — 1 dimensions 
and obtain the intersection properties of these rulings. Certain exceptional 
features of the quadric in 3-space are noted which imply special properties of the 
orthogonal group on four variables. 

As a further application of the tensor codrdinates we obtain a correspondence 
between tensor sets in a complex projective space of 2v — 1 dimensions, P»,_1, 
and collineations in a complex projective space of 2” — 1 dimensions, Py_;. 
This correspondence is then employed to obtain a representation of the rulings 
on the quadric by means of points in two non-intersecting spaces of 2” — 1 
dimensions in P»_,. For v = 3 this leads to the well known Pliicker-Klein 
representation of the points of a quadric in P; by the lines of a 3-space. 

In §15 we obtain a matrix representation of the group of proper orthogonal 
matrices of order 2v by matrices of order 2” for values of v greater than two. 
For odd values of v this representation is shown to be (1-2) and for even values of 
vit is shown to be (2-2). In §16 we give the corresponding theorems for the 
groups of proper orthogonal matrices of orders two and four. 

The author desires to express his sincere gratitude to Professor O. Veblen 
for the stimulus of many discussions on the subject matter of this paper. 


na | Nab 


' Annali di Matematica 24 (3), 1915, pp. 89-120. 
* Annali della R. Scuola normale Sup. di Pisa, Bd. III, 1883, pp. 71-77. 
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2. Definition of the Codérdinate Tensors. An (r — 1)-dimensional linear 
subspace, V, of the real or complex projective space P,_; is determined by any r 
linearly independent points which it contains. If 


Aj, Az, A3, +++, A; (¢ = 1, 2,--. ,n) 
are coordinates of r such points, then the tensor 
tytg**ta 1 iytg*** tr 8 8 8 
vit "= 5 Satan s+ a Aj’ A2* --+ A,’ 


(2.1) 
aad Af® A}? rine ee Air) 


where 6:42 ::: it is the generalized Kronecker delta,’ is said to be a contravariant 
codrdinate tensor of V. Thus the coérdinate tensor of the line determined by 
points A‘ and B‘ is V = 3(A‘B’ — B‘A’), and the components of the codrdi- 
nate tensor of an (r — 1)-dimensional space are (1/r!) times the minors of the 
matrix of r rows and n columns formed by the codrdinates of r independent 
points of the space. 

In computations involving many indices we replace a set of indices such as 
ijt, --- 1, by a single index in parentheses (2). ‘The number of indices in a set (7) 
is denoted by | () | or | 7|, in words, the “length” of (7). Thus (2.1) becomes 


in this notation 
1 


(2.2) vy” - ape A? A;? a< Aji, 
where |s| = |7| = 7. We shall also use symbols such as (k;) to denote sets of 
ordinary indices kak» --- k;, where r = | k; |. 
If we define «;), | 7 | = n, to be +1 if (2) is an even permutation of 
(1, 2, < eies , 2), 


—1 if it is an odd permutation, and zero otherwise, and let ¢(;) be a covari- 
ant tensor of weight —1, then the components of «:;) have the same values in all 
codrdinate systems. This numerically invariant tensor is always present in our 
geometry and we employ it to define a covariant tensor 


(2.3) G@) = my ) eno V® 

associated with V". In this definition p is assumed to be a scalar function 
of the number of indices in the set (4) of weight +1 so that V7) and V“ have the 
same weight, which we shall take to be zero. For many purposes in projective 
geometry it would be sufficient to take p = 1 for all values of | 7 | in an arbi- 
trarily chosen coérdinate system. When we introduce the matrix of a quadratic 
form we will determine the value of p( | 7 | ) so as to give a convenient calculus. 





* For the definition and properties of these symbols see Chapter I of the Cambridge 
Tract No. 24, Invariants of Quadratic Differential Forms, by O. Veblen. 
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Observing that equations (2.3) essentially amount to no more than a renum- 
bering of the components of Vv“, we may invert the equations to get 
24) = Vg 

ale| igi!” 
where « is the numerically invariant contravariant tensor of weight +1 
defined by the equations e = ej. In (2.3) and (2.4) we have given rules for 
lowering and raising sets of skew-symmetric indices, the bar being added or 
removed by the operation. We also use (2.4) to define W? in terms of W,,) 
and then (2.3) gives W;;) in terms of WY’. So far we have no way of defining a 
tensor Vi» associated with V“. In this paper we shall have no occasion to 
raise or lower part of the indices of a tensor by means of €‘” or e). We include 
this possibility in our calculus, however, by extending the bar over only those 
indices which have been lowered or raised. 

The operations of raising and lowering sets of skew-symmetric indices give 
algebraic expression to the law of duality as applied to linear spaces. Thus if 
we choose codrdinates so that Ai = 6i(a = 1, 2, --- , r) in (2.2), Vj) will vanish 
unless (j) is a permutation of ((r + 1), (r + 2), --- ,) and consequently V5) 
is a multiple of 67, 6;3" --- 67,._,;. Hence a set of r independent points in V 
determines V to within a factor and a set of n — r linearly independent hyper- 
planes containing V determines V5) to within a factor. A set of points BL = 
\2Aj will determine the codrdinate tensor AV“, where 2 is the determinant of the 
matrix || 2 ||. Consequently any two sets of r independent points in V deter- 
mine the same pencil of tensors pv. 

Also, the codrdinates X* of a point in V satisfy the linear equations 


(i 


(2.5) Var = 0, 
and the codrdinates of a hyperplane containing V satisfy the equations 
(2.6) v“"xX, = 0. 


Moreover, writing (2.5) in the form eq. Ai' Az? --- A;"X* = 0, we see that 
every solution of (2.5) is linearly dependent on Aj, A2,--- , and A; and so 
determines a point of V. A similar statement holds for (2.6). This completes 
the proof of the theorem 

TuroreM (2.1). A linear space determines its codrdinate tensors to within a 
factor and is wniquely determined by them. 


3. The Quadratic Identities. The codrdinate tensors of a linear space are 
necessarily skew-symmetric and non-vanishing. Not all such tensors, however, 
are expressible in the form (2.1) and consequently a tensor X‘? = X (Ol x 0 
must satisfy additional conditions in order to be a coédrdinate tensor. One form 
of these conditions is given in the 
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TuroreM (3.1). A necessary and sufficient condition that V'! = y® x0 
shall be the coordinate tensor of a linear space 1s that 


rank || V* || = | (k)s |, 
where the set of indices (k) numbers the rows of the matrix and the single index s 


numbers the columns. (Thus || V“* || is rectangular with n'*! rows and n 


columns. ) 
The necessity of the condition follows from observation that equations (2.6) 


have exactly n — r independent solutions so that rank || V“* || = r = | (k)s . 


To prove the sufficiency of the condition we need only choose codrdinates so 
that a set of n — r linearly independent solutions of (2.6) are 6;"7, 6;*?, ..., 


and 67; then V“ will be a multiple of 51°! 63? . -. 657): 

Lemmal. If V = vi! % 0, then 

rank || V“* || 2 | (k)s|. 

If rank || V“* || were less than | (k)s | = r, equations (2.6) would have more 
thant n — rindependent solutions. By a suitable choice of codrdinate system a 
linearly independent set of these solutions could be taken to be 4;, 43, «+ , &, 
wheret >n—r. Then V™! = V%? = VY? =... = V™ = 0 and since r 
indices (7) cannot be chosen different from one another and from all the numbers 
(1, 2,3, --- , 2), we would conclude from the skew-symmetry of V“ that V = 0, 
contrary to hypothesis. 


THEOREM (3.2). If 
XY ns = 0, where X® = X'@ 2 0, Yu = Yu) # 0, 
and the indices range from one to n, then 
(1) || + ]y| Sn, and 


(2) if|x| + |y| =n, then Yq = XG) and X™ and Yw are coérdinate tensors 
of the same linear space. 

For each choice of the indices (7) the vector Y,;), belongs to the right null-space 
of the matrix || X“* || and consequently 


(3.1) rank || Y(,)s || S$ m — rank || X°* ||. 
But by Lemma 1, 
(3.2) |x| < rank || X* || and | y | S rank || Ys ||. 


Combining these inequalities we have the first part of the theorem. Moreover, 
if|x|+|y|= n, the equality signs must hold in (3.2) so that by Theorem 
(3.1) X™ is a codrdinate tensor. Choosing coérdinates so that 


Xx = ray” e.g 
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the hypothesis of the theorem implies that Y¢j1 = Y¢j2 = --- = Yi iz) = Oand 


consequently Y(y) = wbtyr | dy, | 7? +++ 67,18 a multiple of Xg). 


TurorEM (3.3). A necessary and sufficient condition that V = V'! x 0 
shall be a coérdinate tensor 1s that 


(3.3) o'Vin ao ik 


The sufficiency of the condition follows from Theorem (3.2) by putting 
yx = V and Y() = VG. The necessity follows from the observation that 
when V“” is expressible in the form (2), then for each set of values of the indices 
(k) and (j), V“"VGye is a multiple of a determinant in which two rows are 
equal. 


Equations (3.3) obviously imply the vanishing of KY Vite if (s) con- 
tains more than one index. In Theorem (5.4) we shall prove that 


V"'Varn = 0 
implies the apparently stronger condition (3.3). This will prove the 


TuEoREM (3.4). A necessary and sufficient condition that V° = V'! x 0 
shall be a codrdinate tensor is that 


(3.4) V"Vin = 0. 


The quadratic relations satisfied by the components of a coérdinate tensor 
are usually given in the non-tensorial form‘ 


(3.5) Vimy (med + y (macy (mdb 4 V (mad 7 (mbe = 0, 


where the set of indices (m) occurs twice but without being summed. These 
equations are obtained by multiplying (3.3) by «?”, summing, and putting 
(m)a for (k). We shall here employ only the tensor form of the quadratic 
relations, (3.3) or (3.4). 


4. Joins and Intersections. We call the linear space of smallest dimension- 
ality which contains two linear spaces, X and Y, the join of X and Y and denote 
it byX+Y. Similarly, we call the space of greatest dimensionality contained 
in both X and Y the intersection (meet) of X and Y and denote it by XY. 
We write 0 and 1 for the null-set and the entire space, respectively. The codrdi- 
nate tensors for 0 are a scalar and e,;). For 1 they are e”” and a scalar. 


THEOREM (4.1). If X, Y and J = X + Y are linear spaces with the respective 
‘ordinate tensors x® y™ nial J®, and XY = 0, then Ja te proportional to 
oer and oO XHGY?. If XY # 0, and|z| + |y| Bm, Yoox” = 

(i)(y) = 0. 

—————_. 


‘ Bertini, Einfiihrung in die Projektive Geometrie Mehrdimensionaler Réume, Vienna, 
4, p. 43, formula (14). 


~yee 
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If X is the join of the | z| points Ai, Ae, --- , Ajz; and Y is the join of the 
| y | points Bi, Be, oY , By,;, then 


J=A,+Aet--- + Ape, + Bit B+--- + By, and XY =0 
implies that these | z | + | y | points are linearly independent. Consequently, 
JG = €@@wm Al’ AY +--+ Ajz] Br By --- Biz 


is different from zero and is by definition a covariant coédrdinate tensor of J. 
Using the definition of X® and Y™, 


ee yw) _ jz|lyl (y) y (2) 
JH= Ee @mX Y" = (1) Eye) YU X™, 


so that J) is proportional to YamxX ® and to XmmY™. If XY ¥0 the 
points Ai, Ae,---,A;z, Bi, Be,---,Byy; are linearly dependent and the 
expressions vanish. 

A similar proof gives the dual theorem. 

TueroreM (4.2). If X + Y = 1 and XY = I, then I is proportional to 
XY and to YONG. If X + Y # 1, and|x|+|y| 2 2, then 


> aes go i yore” = 0. 


TueoreM (4.3). If X and Y are linear spaces with codrdinate tensors 7 
and Y™, respectively, then for each value of |s| S|y|and = n — |x| the 
“projection operator” p\;} = Y XGycm defines a transformation 


(4.1) o = pi 


of linear subspaces of dimensionn — |x| —|s|—1 into linear subspaces of 
dimension | y| —|s|—1. The transformation has the following properties: 

1. If y” is the codrdinate tensor of a linear space w such that Xy = 0 and 
Y+X+y=1, theng(# 0) is a codrdinate tensor of the linear space Y(X + ¥). 

2. If either or both of the conditions Xp = 0 and Y + X + wp = 1 fail to hold, 
then go? = 0. 

The theorem follows immediately by applying Theorem (4.1) to the spaces 
X and y and then Theorem (4.2) to the spaces Y and (X + y). Dualizing 
Theorem (4.3) gives 


THEOREM (4.4). Yap{j} is a codrdinate tensor of WY + X unless y+ Y #1 
orW YX ¥ 0 and then it is zero. 


TuroreM (4.5). If X® and Y™ are coérdinate tensors of linear spaces and we 
form the sequence of tensors 
@y- ylides (i)(¢2) y¥——— 
YO XH, VO" XG YO XG °° 


(4.2) (ia)(ea) y = (sa) 
Tee tinted’ ORO BT tat Pree e 
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where | 8 | = 7 and a is the smaller of the two numbers n— |x| and|y|, then 
the last non-vanishing tensor factors into the form 


(4.3) IJ, 


where I” and J;;,) are codrdinate tensors of I = XY and J = X + Y, respectively. 

The spaces X and Y intersect in a space of dimensionality < |x| — 1 and 
<|y|—1. Hence if I“ is a codrdinate tensor of I = XY we may put | i | = 
\y| —-rwherer 20. Now applying Theorem (4.3), the expression 


(ip) (ip) lop) yee 5) 
(4.4) gn = Xan” 


is either zero or is the coérdinate tensor of g = Y(X + y). In the latter case 
includes XY and its dimension, | 7, | — 1 = | y| — r — 1, equals the dimension 
of XY. Hence y = XY and gy” = J“”. Under our assumptions a space y 
can be found satisfying the conditions XY = 0 and Y + X + y = 1 and for 
this space y"” # 0 by Theorem (4.3). 

It is now convenient to regard (4.4) as a vector equation g = My where M is 
the rectangular matrix || Y“”°”XGpa=, ||. Choosing as a basis for the set of 
all vectors y the coérdinate tensors (7), where | k | = | j, | and (p) is any set 
of |k | numbers between one and n, the matrix M transforms every vector of 
the basis either into the zero vector or into a multiple of I“”. Since M is not 
zero, it is of rank one and therefore factors into the product of two vectors. 
That is, 


(4.5) y (ir) Gr) Xuan me J Joes 


where J” is a codrdinate tensor of XY. Applying Theorem (4.4) to this 
projection operator proves that J;;,) is a coérdinate tensor of X + Y. 

Summing off r + 1 indices of Y™ against r + 1 indices of Xj) gives zero since 
each component of this tensor is a sum of terms of the form P’Q;, where P is a 
point of J = XY and Q is a hyperplane containing J = X + Y and a fortiori 
containing P. It is obvious that if any one of the terms of (4.2) vanishes, then 
all following terms also vanish. 





If we exchange X and Y in (4.2) we get the sequence 


(Owe witha wade PS 
XY G, X Yipes ee VGaptes) 9 °° 


(4.6) ; 
See yt Oe Ae gh eee 
where | s,| = r and a is the smaller of the two numbers n — |y|and | z|. 


Itis to be observed that the number of terms in the sequences (4.2) and (4.6) are, 
respectively n —|2|+landn—|y|+1lifns|2|+|y|,and|y|+1 
ind|z|+1ifn >|2|+4 |y|. Hence the sequences are in general of unequal 
length. For codrdinate tensors X and Y™, Theorem (4.5) implies the 


| THEOREM (4.6). If X@ and Y™ are any two skew-symmetric tensors, then the 
ast non-vanishing terms in (4.2) and (4.6) are proportional. 
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To prove this theorem without restricting X ® and Y™ to be coordinate 
tensors we use the definition (2.3) and its inverse (2.4) to express Y 5) and X” 
in terms of Y” and X(q), respectively. This gives 


(i) (r) — (i) (m) wy=- 
xe Yom = Mag YX) 


(4.7) i(s —e i fo-e? aig 8 
= UY‘ . Xe + deb; ee — UXis vee Gtgt)(s1) + 


+ additional terms in which more than | s| + 1 indices are summed, where 
| s: | = | s| + 1 and A, Ay, and 2g are scalars which depend on the lengths of the 
various indices. Consequently, if Y““’ Xtc is the last non-vanishing term in 
(4.2), : 


(4.8) 2 OVD = MYO°°XRR@, 
where |r| =|s|—|y|+|2]|, and hence X°” Yq) must be the last non- 


vanishing term in (4.6). 


5. The Quadratic Form. We now introduce into our geometry a funda- 
mental quadratic form 7;;X'X’ and use the symmetric matrix || 7;; || to lower 
single tensor indices by the familiar rule, X; = yi;X 7, Covariant indices are 
raised by the rule Xv’ = X’, where y'"yx; = 4}. 

If we lower each of the indices of a skew-symmetric tensor X” by y;; the 
resulting tensor is skew-symmetric, and is denoted by X,;,._ Applying the rule 


A 


(2.4) to raise the indices (7) by means of gives the tensor X“. 


1 
p(lz|) ||! 
Alternatively, we could first lower the indices of X“ by the rule (2.3) to get X (i) 
and then raise them with y*’. The result of these two processes is the same to 
within a factor. Indeed, by using (2.3), (2.4), and the formula for the deter- 


minant of the matrix || y’’ ||, we find that 


(5.1) ORS = [o(1t]) on — [4 |) v1) OTP] Xe, 


(s) (4) 8171, Soi Siit Tal 


where I teliae See 7 and y is the determinant | ¥;; |. 
The scalar factor in the right member of (5.1) will be +1 if we put 


(5.2) o( |i] ) = y(—1yleot??, 


This choice has the disadvantage that p(|7i|) is sometimes imaginary. This 
cannot always be avoided without introducing additional complications into the 


calculus for in the special case in which n = 2v and | 7 | = », (5.1) reduces to 
(5.3) OX = POY (-1X2. 
so that p(v) = y'(—1)”. However, if || yi; ||is a real matrix of signature 


(+-—+— --- +-—), then p(y) is real (cf. Theorem (6.2)). 
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We now restate our rules for lowering and raising sets of skew-symmetric 
indices. They are 


- {t} 7 i 
XU = eww X”, 


[z|! 
(5.4) and 
\. {i} y (3) (i) 
astra il 
where 
(5.5) ' , {i} ue (gyn Storrs 


In (5.4) if a bar (—) is added to the X in the right member it is erased from the 
left member. We observe that {7} = {k}if|i|-+|&| =n. The operations of 
our calculus are summarized in the theorem* 


TuEorEM (5.1). If single tensor indices are lowered and raised by y;; and y"’ 
while sets of skew-symmetric indices are lowered and raised by the rules (5.4), the 
resulting calculus 1s consistent and gives the relations indicated in the diagram 








i € - 
x = ee X mn) 
(5. 6) Y Y 
Pen . yim 
X (i) ¢ . —~ Xe 


The rules regulating the manner in which sets of summed indices may be 
raised and lowered are computed to be 


(5.7) XY =XwmY®, XOVS = XHY®, 

and 

(5, _1 yw Lo AORe! doa: gg 

5.8) rayi~ Yo = Ey %o ¥-. . 


The numerical coefficients in equations (4.7) may be evaluated by computation 
with (5.4). The result of this computation for the first of these coefficients is 


. Contained in the 


cee, 


: If yij = —vii, & factor (—1)! *! must be added to the right member of (5.1). Theorem 
(3.1) will continue to hold if we put {7} = (—1)#! #!@-lé!+) jn (5.4). In order that | yi; | #0, 


at be even and hence {i} is always real. Equations (5.7) of course require modification 
ma —73;. 
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Turorem (5.2). If X® and Y” are skew-symmetric tensors, | i | is 
and =< |y |, and | j| is <n-—|zx|and <n —|y|, then 


IA 
= 


i)(r ri|s 8 ws 
{2 x00 Yinw a (—1)! Ils] { & yo bieyery 
(5.9) M 
wir?” biG YO Xo, 
where (i) = (i:)(t2), (7) = (j1) (je), and M is the smaller of the two numbers | y | 
and n — |x|. 

This useful identity depends only upon the rules for lowering and raising sets 
of skew-symmetric indices by means of the numerical tensors and does not 
involve the tensor yi. If |i; | is not available we suppose the factor y in 
(5.4) to be an arbitrary scalar of weight two. 

Special cases of (5.9) are stated in the following theorems. 


TurorrM (5.3). If X® and Y™ are skew-symmetric and | x | + | y | =n, then 


(5.10) wae xO Tino ai (- gyhetbettiet oe yoo Xe 


FT OF 


+ additional terms in which more than | s | indices are summed. 


Turorem (5.4). If V°"© Vira = 0 and | s | is odd, then VV" Vijpm = 0. 

To prove this we put X° = Y® = V™ in (5.9) and replace (i) and (j) by 
(v)r and (w)p, respectively. This theorem, for |s| = 1, was used in the proof 
of Theorem (3.4). : 


TuroreM (5.5). If X® and Y™ are skew-symmetric and |x| + |y|= %, 
then 








1 i(r) lz|]y| i(s) (x® Yo) i 

(5.11) Tr]! a Yim -t- (—1) ° ls]! TI Y j(s) = rae vy | 6;. 
This is the special case of (5.10) in which all but one of the indices of X are 
summed against indices of Y¢). The scalar factor in the right member of 


(5.11) is most readily obtained by taking the trace of the left member and 


* using (5.8). 


An application of our calculus to an elementary problem in projective geome- 
try is contained in the 


TuroreM (5.6). If X and Y™ are coérdinate tensors of linear spaces, X and 
Y, where X + Y = land XY = 0, then 


(5.12) Sf = 2X? Vig — (— itil Ey AT 


defines a collineation ¢' = Sip’ with the pointwise invariant spaces X and r, 
This collineation is of period two unless X is 0 or 1 and then it is the identity. 
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In proving this theorem it is convenient to abbreviate (5.12) to 


(5.18) S= X.Y — xY-X, 

and by a similar abbreviation to replace the identity (5.11) by 
(5.14) X-Y +AY-X = ul. 

Then S can be written in the forms 

(5.15) S = 2X-.Y — ul, 

and 

(5.16) S = pl — 2Y-X. 


The square of the collineation y* = Sj’ is the identity, for 
SS =(2X-Y — ywl)(ul — 2Y-X) 
= 2u(X-Y + rAY-X) — pl 
2 
= MK 1, 


where the “product”? —4 \(X-Y)(Y-X) vanishes since it contains the tensor 

YoY which is zero by Theorem (3.3). The collineation is non-singular 

since » = 0 would imply XY # 0, contrary to the hypothesis. A point y’ in 4 
the space Y satisfies the conditions Yj7py’ = 0, so that using (5.15) we have 


t j 2 i(r, a j 7 | 
jv) = (2x Yin — wil) v = — a | 


r|! 
Hence the points of Y are invariant. Similarly, using (5.16), the points of X are 


invariant. 


6. Linear Spaces on a Quadric. The tensors V“ and V3 have been inter- 
preted as codrdinate tensors of the same linear space. If we write 


Vv ais > ee A"™B* hoy H*', 
we observe that 


Vay = > + Ai,Ai, tee H;,,,. 





The hyperplanes A;, B;,---,H; are the polars, respectively, of the points 
A’,B’, ... , H' in the quadric ' 


(6.1) Q: yisX'X’ = 0. 
Hence Vi» is a covariant codrdinate tensor of the polar of the space V". Of 


(i) ‘ Utes : . 
‘ourse Vis a contravariant coordinate tensor of this same space. Since a 


linear space V lies on Q if and only if it is contained in its polar space, we can 
we Theorem (4.5) to get the 
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TuEoREM (6.1). A space with codrdinate tensor V lies on Q if and only if 
(6.2) VV cine = 0. 


Reference to the first part of Theorem (3.2) shows that (6.2) implies 2 | » | < n, 
so that the maximum number of linearly independent points in a space on Q is 
1(n — 1) when nis odd and $n when niseven. We now restrict our considera- 
tions to the latter case so that (6.1) defines a quadric in a projective space, 
P2,-1, of 2v — 1 dimensions. A linear subspace of P21 of vy — 1 dimensions 
will be called an “axis.” An axis then has a coérdinate tensor V“, where 


|¢| =». The polar of an axis is again an axis so that V“ will lie on the quadric 
if and only if it coincides with its polar space. That is, 
(6.3) V® = pv®. 


Lowering the indices (z) with 7.) and raising them again with «° gives 
V® = pV™ and consequently p = +1. 


THEOREM (6.2). A necessary and sufficient condition that a codrdinate tensor 


V‘( | 4 | =v) shall determine an axis on the quadric (6.1) is that either 
(6.4) Fy; V° = +V2, 

or 

(6.5) FO ee ¥. 


The axes on Q are thus separated into the two families F, and F_, character- 
ized by (6.4) and (6.5) respectively. The association of the plus sign with one 
family and the minus sign with the other is a matter of arbitrary choice. Indeed 
it is determined by the selection of one of the two values of the factor 7 occurring 
in (5.4). 

THEOREM (6.3). If V and W are two azes on Q, then 


(6.6) dimension of VW = v — 1 (mod 2) 
if V and W belong to the same family, and 
(6.7) dimension of VW = v — 2 (mod 2) 


if V and W belong to different families.® 
Since V and W are axes on Q, 
(6.8) VO ary VO, Wo = dWeW®, 4] =», 


and the spaces will belong to the same family if \yAw = +1 and to different 
families if \yAw = —1. Let us put D(VW) = 1 + dimension of VW. Then 
to prove the theorem it is sufficient to show that D(VW) > u implies D(VW) > 





6 A proof of this theorem based upon stereographic projection is given by Bertini, loc. 
cit., p. 148. 
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utlifry\w = +1 and »y — » = 0 (mod 2), orif \yAw = —landy — yp =1 
(mod 2). 

Assuming that D(VW) > » and applying Theorem (4.5), we have V2.2 Wi 
=(Qfor|s| 2»—. Hence if we put X = V™ and YS = W «) in equations 
(5.10) they reduce to 


69) VO°Wam = —(-1)"™ WL2V Gm, where |i| =» +1. 





On account of (6.8) these equations imply 
(6.10) err Wii nes (= 1)” ”rrAwW Vy (r) « 


We wish to prove that under certain conditions WV.) = 0, for this 
implies D(VW) > || =” +1. If thisis not the case, then by Theorem (4.5) 
W?Ving = T° Sy where I and J;;) are coérdinate tensors of VW and 
V + W, respectively. Since V + W is the polar of VW, J) = pyipwl and 
consequently W° V@,, = pI () is symmetric in the sets (7) and (j). Hence 
WOVD,. = WIV, = VO W,,. Substituting in (6.10) now gives 


(6.11) (1 + AvAw(—1) 9) VOW = 0, \ 


and therefore V° Wij) = Oif AyAw = +1 and vy — np = 0 (mod 2) or if 
\y\w = —landv — » = 1 (mod 2). 

TuzorEeM (6.4). If P21 is complex, or if it is real and the signature of Q is 
zero, then Q contains axes. If Ps, is real, Q contains axes only if its signature 
18 Zero. 

We omit the proof. 

TaroreM (6.5). If Q contains axes, then axes V and W on Q exist for which 
D(VW) = a, where ais any integer > Oand S »v. 

Under the hypothesis of the theorem we may choose coédrdinates in the real or 
complex P2,_; so that the equation of Q is 


(6.12) xix’? + x’*x’?? + ‘et + 6b dof =. 0, 


Let us call the vertices of the codrdinate 2y-point Hi, E2,--- , E,, E 1) Ba, ++ 
and Z,, where the coérdinates of E,(a = 1, 2,---,¥v) are 6 and of E, are 
bi». Then the spaces 


VE Ai+het+---+ Fat Fan t--- +2, 
(6.13) and 


/ 


W=R+hh+---+Eet+t Ban t+: +8, 
lie on Q and D(VW) = a. 
TukoreM (6.6). If the quadric, Q, in Ps, contains the distinct axes V and W, 
' > 2, and 6 is any integer = 0 and < v, then there exists an axis, A, on Q such 
that D(AV) =Bx D(AW). 
The theorem cannot be extended to include the case in which » = 2,8 = 1, 
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and D(VW) = 0 for then V and W would be non-intersecting lines on a quadric 
in P3 and any line on Q which intersects one of two such lines in a point also 
intersects the other in a point. In the other cases which occur when v = 1 or 2 
the theorem is trivially satisfied. 

Let us put D(VW) = a(< vr). Then it can be shown that by a suitable 
choice of coérdinate system we may take the equation of Q to be (6.12) while 
V and W are the faces of the codrdinate 2v-point given by (6.13). We now 
distinguish several cases: 

Casel. B<a. Take A= FE, + Fo +--+ Ep + Bon +--+ 2. 
Then D(AV) = B, and D(AW) = B+v—a>B8. 

Case 2. B 2aand28 #v+a. Take A asin Casel. Then D(AV) = 8, 
and D(AW) =a+v—6B ¥8. . 

Case3. 28=v+a. TakeA=Ei;+BE:+--- +E, g+Byput-- + 
E,. Then D(AV) = 8, and ifa +8 2», DAW) =a+8 —»v £8, orif 
a+ B<v, D(AW) =v — a — 8B, which is different from 6 unless 28 = » — a. 
In this event a = 0, and 26 = ». 

Case4. 28 =v>2,a=0. The3-space H,. + E,+ E;_. + E, intersects Q 
in a non-degenerate quadric which contains the non-intersecting lines E,, + 
E, and E}_,+ E.. Let F + G be any third line of the regulus containing these 
twolines. If weputA =A, +Ao+--- +Ept+ Eon t+--- +H tFH+G, 
then D(AV) = 8B = 4 vand D(AW) = B—2 £68. 

THEOREM (6.7). If two non-singular collineations of Py (vy > 2) leave Q 
invariant and effect the same transformation of the axes of Fs. , then they are identical. 

Distinct axes of F, are transformed by the collineations into distinct axes 
which again lie on Q. Since intersection properties are preserved, the trans- 
formed axes will all belong to the same family, which may be either F, or F_. 
The following proof applies to both these cases. 

By the preceding theorem an axis of F_ is unambiguously determined by the 
axes of F, which intersect it in a space of 8 — 1 dimensions, where f is any 
number (say v — 1) congruent to » — 1 (mod 2) and between 0 and » — 1. 
Hencé the two collineations must effect the same transformation of the axes of 
F_. Since every point on Q is the complete intersection of two suitably chosen 
axes on Q (of the same or different families), the two collineations transform 
the points of Q in the same way and are therefore identical throughout the space. 


7. Representation of Points of P2,; as Involutoric Collineations of P21. 
The non-singular quadratic form y;;X'X’ cannot be written as the square of a 


linear form in the 2y indeterminates X’, X’, ... , X” so long as the coefficients 
of the linear form are restricted to the field of complex numbers. The equation 
(7.1) (X'yi)” = (vsjX'X) 1 


is, however, an identity in X’ if the coefficients yi are suitably chosen matrices of 
order 2”. To show this we first introduce a codrdinate system in P2,-1 in which 


(7.2) yisX'X? = (XY? + (X*)? + + (PY. 
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We shall call a coérdinate system cartesian if y;;X'X’ has this form init. If we 
assume that y: behaves as a covariant vector under transformations of co- 
ordinates in P21, a solution of (7.1) in cartesian coédrdinates gives rise to a 
solution of (7.1) in arbitrary coérdinates. 

In a cartesian coérdinate system, equating coefficients in (7.1) gives 























(7.3) (y:)” = a ‘and Yi = - VN: (i ” j). 
For y = 1 matrices satisfying these equations are 
a 0 1 
(7.4) yi = and vs? = ‘ 
The recursion formulae 
(v—1) 
Ya 0 
(») 
% = i _4» | (a = 1,2,---,2(»— 1)), 
(7.5) 
(v) 0 t lot (») 0 lo»-1 
Y2a-1 = J ’ and 72 = \y 
my lor-1 0 lor-1 0 | 























then supply solutions of equations (7.3) for every value of ». 

The matrices y; = || vis || (@ = 1, 2, --- ,2v; A, B = 1, 2,---, 2”) forma 
basis for a linear family of involutoric collineations in P»_;. A codrdinate 
vector X* of a point in Ps,_; determines a matrix X = || X“, || by means of the 
equations 


(7.6) X = X'y:. 
Equally well, these equations determine a representation 
(7.7) pX*<>p || Xp || 


of the points of Ps,_; by involutoric collineations of P2»_; defined by the pencils 
p || X“, || of matrices. 


8. The Correspondence Between Tensor Sets and Matrices. In a cartesian 
codrdinate system the product of any number of distinct matrices of the set 
1) 72, +++ , Y2y has zero trace. This follows from the equation 


(8.1) Vo(VinVig ..* vi,)vr = — (ViVi tage Vi,)s 


which follows from (7.3) if v is equal to 7; when r is even and different from 
h,”,+++ , and 7, when 7 is odd. 
By using the commutation rules (7.3) and the relations 


(8.2) Trace (yy; --- ys) = 0, (i,j, --- , 8 all different) 


it can be shown that the matrices 


(8.3) l, Yi, ViVi (i < D; 9 5 Vi VieVizs °° * Vie, (1 < te Saees tay) 
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are linearly independent. Since there are 2” of them every matrix of order 2’ 
can be uniquely expressed as a linear sum of these matrices. That is, an arbi- 


trary matrix X = || X“, || of order 2’ is expressible in the form 
2Qy 
1 i 
(8.4) X = 2, ik X® 86, 


where X° = X'! 5°. = sep) and for different values of the indices (i) = 
itz +++ 4) 4), 8c is defined by the equations 


(8.5) Si) = ViVig ++ * Yuu 


It is understood that when |7| = 0, s) = 1. When X is a matrix of the linear 
family X‘y; the coefficients X“ in (8.4) vanish unless | 7| = 1 so that equations 
(8.4) include (7.6) as a special case. 

The quantity sq@ = || s¢:)s || was defined to be skew-symmetric in the indices 
(i). This property is preserved under transformations of coédrdinates. Indeed, 
we may define s,,) by the formula 
(8.6) 8 = 1 803) Vin Vie *** Yisrs 
which reduces to (8.5) in a cartesian coérdinate system. 

We can solve for the coefficients X“” in (8.4) in terms of the matrix X. Todo 
this we employ cartesian codrdinates to prove that Trace ss.) = Oif |¢| ¥|j| 
and otherwise it is a multiple of 4(;}. The exact relation 


(8.7) Trace (sy) = (—1)#*! CFI 9 alt] 52 


(7) 





t 
7 





may be verified in a cartesian coérdinate system and evidently remains valid 
under codrdinate transformations. Multiplying (8.4) by s“ and taking the 
trace therefore gives 


(8.8) XO om (—1) 1 S116) O° Trace X 8 
and we have the 

THEOREM (8.1). Equations (8.4) and (8.8) establish a (1-1). correspondence 
(8.9) = || X*n || <> {X, X‘4, X= —X*,..., XO = XO i] = 2y)} 
between matrices of order 2” and sets of skew-symmetric tensors.’ 

9. The Collineations Corresponding to Linear Spaces on the Quadric. The 


correspondence of this theorem allows us to represent a linear subspace of 
P2,_1 as @ collineation in the spin® space P2»_,. To-do this it is only necessary to 





7 The ordinary rule for the multiplication of two matrices gives rise to a rule for the 
“multiplication” of the corresponding tensor sets. A discussion of this multiplication of 
tensor sets is included in the paper by R. Brauer and H. Weyl, Spinors inn Dimensions, 
Am. Jour. Math., LV II (2), 1935, pp. 425-449. 

* The term “‘spin space” is appropriate for P2»_, because for » = 2 it is the space of the 
spin variables introduced by Dirac in his theory of the electron. 
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take the tensor set to be {0, 0, --- ,0, A“, 0, --- ,0} where A“ is a codrdinate 
tensor of the linear subspace. 

A lemma which we shall need in the proof of the next theorem is 

Lemma 2. If A and B are square matrices, A* = B® = 0, and AB = + BA, 
then 


(9.1) rank AB S } rank A. 


This is obtained from the general relation’ (holding between any three matrices 
A, B and C) 
rank AB + rank BC S rank B + rank ABC 


by putting C = A. 

TuzoreM (9.1). If L“ is a codrdinate tensor of a linear space L on the quadric 
and = nik L 8, then rank } =2”"'*!| and unless L is the null set * = 0. 

Let us take Ii, Le,---, LZ, 4; to be|7| linearly independent points on L. 
(If || = 0, L is the null set and\ = 1.) Then L = ly + Le Sis Ly ¢}. 
It is possible to find additional points Ly ;\41, --- , Z, such that A = L, + --- 
+ L, is an axis on the quadric. Then y,;L,L; = 0 for a, b, = 1, 2,---,», 
where L, is a coérdinate vector of the point La. 

Equating coefficients of the arbitrary variables X* in (7.1) gives 


(9.2) avg + yi) = isl. 
Multiplying both members by LiL} and summing gives 
(9.3) Nady = — Doda (a,b = 1,---,») 


where 4. = Liy;. 

In particular, (Aa) = 0, so that rank A, < } (order of \.) = 2”. The 
product of any number of the matrices \, also has square zero for (Agds -- - Ap)” = 
+ (Ne)"(u)” +++ (Ap)? = 0. Since L® = LIL... Livy = Made e+? yay 
and therefore \” = 0. 

Repeated application of the lemma now gives the chain of inequalities 


rank \y es ae 


rank (), Ae) < } rank \; in 


“IIA 


(9.4) » “ . 
rank (Arde +++ Ae) S drank (Ai Ae - + Apu). S 2” 


rank (Ay Me =++ dy) S frank (Arde +++ Awa) $2 = 1. 
However, if A® is a codrdinate tensor of A = Li; + Lz: + --- + L,, then 
a AM s5 = ike --- A, ¥ 0 and consequently rank (Ai\2 --- A,) = 1. Hence 
the inequality sign cannot hold at any step and rank (Aide -- + Ajai) = 2” 1! 
'C. C. MacDuffee, The Theory of Matrices, Berlin 1933, Theorem 8.3. 
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TuroreM (9.2). If A‘?(|i| = v) is the coérdinate tensor of an axis on the 


' 1 é 
quadric and a = —. A 8), then 
[7 | 


! 


(9.5) a = || a*s || = || ¥*ge|l. 


By Theorem (9.1) the rank of a is one and hence all the columns of the matrix 
are proportional to any non-vanishing column with elements yj’, y’, ..., y”. 
If the factors of proportionality are called ys, we have (9.5). Interpreting y“ 
as the codrdinates of a point in the spin space we have a representation 


(9.6) A-y 


of theaxes on the quadric in P2,_; by means of points in Py_;. In the following 
sections we shall study the figures in the spin space determined by linear spaces 
on the quadric and apply these results to the representation (9.6). 


10. Spaces in P,_, Determined by Spaces on the Quadric in P2,_,. The 


matrix B = a7 Bs: corresponding to a linear space B with codrdinate ten- 


sor B™ is, by Theorem (9.1), singular if B lies on the quadric. Hence the col- 
lineation determined by £6 transforms all the points of P2-_,; into the points 
of a subspace R,, the “rank” space of 8. A point y with codrdinates y“ 
will be contained in Rg if and only if there exists a codrdinate vector 6” such 
that y* = B*, 6”. 

The singular points of the collineation form a linear space Nz, the “null” 
space of 8. Thus a point y belongs to Nz if and only if By = 0. It is then 
evident that D(Rs) = rank 8 and D(Ngz) = nullity of 8, so that 


(10.1) D(Rs) + D(Ng) = 2’. 


In the following theorems of this and the next two sections, we shall suppose 
that B is a linear space on the quadric and that 8 is the matrix corresponding to 
B under (8.9). 


THEOREM (10.1)... Rg is a subspace of Ng. 

If y belongs to Rs, ¥ = 66. But 6 = 0, so that By = B’@ = 0 and hence 
also belongs to Ng. The space Rg will be a proper subspace of Ng if the rank of 8 
is less than the nullity of 8 and the spaces will coincide if the rank and nullity are 
equal. By Theorem (9.1) the latter case will arise only when B is a point and 
then Rg and Nz are coincident axes. Since Trace (X‘y;) = 0, the pointwise 


invariant spaces of a non-singular involution of the family X‘y; are of the same 


dimensionality. They are therefore a pair of non-intersecting axes in P2-1. 
The spaces Rg and Ng may be regarded as the coincident axes of the singular 
involution corresponding to a point on Q. 

In a cartesian coérdinate system the matrix 


(10.2) Yo = (—1)*yxy273 os ie 
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satisfies the equations 
(10.3) (yo)” =1 and yi = — yin. 


Expressing Yo in the form (8.4), we could easily show that these equations deter- 
mine Yo to within sign. It is evident that yo continues to satisfy (10.3) if 7; is 
replaced by tiy;, where | t; | # 0, and hence the involution defined by the pencil 
of matrices pyo is uniquely determined by the linear family X‘y;. We shall 
call it the “invariant involution.” 

The invariant nature of the matrix yo is put into evidence if we define 


—1)" | vi {7 
(10.4) yo = ( on & Vey Ves ** Vass 





which we may do since the right member reduces in a cartesian coérdinate 


"system to (10.2). The advantage of (10.4) is that it determines a particular one 


of the two matrices satisfying (10.3)—at least to within a choice of one of the 
square roots of the determinant | ¥;; |. 

If this root is the same as that used in the definition of Xz) in terms of X? 
(cf. (5.4)), yo satisfies the equations 


(10.5) vos = (—1)**1*P 85, 
and 
1 i v—| i j 
(10.6) Yo (+ x sa) = (—1)* siias (+ Xs), 
|| lg\! 


These equations are most easily verified in a cartesian codrdinate system and 
their tensor form then insures their validity in all projective coérdinate sys- 
tems in P2,;. The corresponding formulas for multiplication on the right by 
y are obtained from these by using the invariant relations 


(10.7) 708) = (—1)!"! sayyo. 


THEorEM (10.2). The invariant involution leaves Rg and Ng each invariant. 
Ee 0: . 
Since B = aq B‘ Ba, where B“ is a codrdinate tensor of the space B on the 
quadric, we can use (10.7) to get 708 = (- 1)'*'Byo. If pis a point of Rg so that 
¥ = B'0°, then y“ony” = B4s[(—1)!yo.~°] and hence the involution transforms 


¥ into a point which again belongs to Rs. The invariance of Ng follows 
similarly, 


ll, The Invariant Polarity. Using the numerical values of the matrices 7; 
which we gave in (7.4) and (7.5), it is easily proved that the matrix 


(11.1) C= ViV3V5 °° * Y2v—1 


satisfies the equations 


(11.2) c’ _ eC, om (—1)""™ 
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and 
(11.3) (Cyo)’ = fCy,  (Cyi)’ = f(Cri), f= (-D)e. 


These equations will remain valid in all projective coérdinate systems in P,,_, 
if we take C to be a matrix of the type” C = || Cuas||. We assume that C 
behaves as a scalar under transformations of codrdinates in P»,_; so that equa- 
tions (11.3) remain valid after a transformation affecting the index 7. 

The equations 


(11.4) ga = Casy” 


define a point — hyperplane transformation of the spin space which is a polarity 
in a quadric or in a linear complex, depending on the value of v. Equations 
(11.3) imply that this polarity is geometrically commutative with the invariant 
involution and with every involution of the linear family X*y;. We shall call 
it the “invariant polarity.” 
THEOREM (11.1). The invariant polarity interchanges Rg and Ng. - 

Let us put D(B) = r and take Bi, Bi, --- , Bi to be codrdinate vectors of r 
linearly independent points in B. Multiplying the second of equations (11.3) 
through by Bi, (p = 1, 2, --- ,r) and putting B, = Biy;, we get B,C’ = fCB,, 
or, using (11.2) 


(11.5) CB, = + B,C. 


Since || Cuz || is non-singular, D (polar of Rg) = 2” — D(Rg) and combining 
this equation with (10.1) gives D (polar of Rg) = D(Ng). Hence to show that 
the invariant polarity transforms Rg into Ng it will be sufficient to prove that the 
polar of Rs contains Ng. This will be the case if the polar hyperplane of an 
arbitrary point y of Rg contains every point of Ng. We shall therefore have the 
theorem if we can prove that o*Ciny” = 0 whenever y” = 8’ c6° and 
Bue = 0. 

Calling || ¥~ ||, || ¢* || and || 6* || matrices of one column, the condition is that 
¢’'Cy = Owhenevery = B@and Be = 0. Butsince B = fife --- B, we can employ 

(11.5) to get 


o’Cy = ¢'CBO = y'Chip: --- BO = + (y'BiBs --- B,)CO. 


The points B, are conjugate by pairs in the quadric and consequently the 
matrices 8, anticommute by pairs. Hence 


(v'BiB2 --+ B;) = (8, «+ Bebry)’ = + (BiB --- Bw)’ = + (By)' = 0 


and substituting in the last equation gives the required condition, g’Cy = 0. 


e 


12. Properties of Rs and Nz. The matrix yo satisfies the equation a —1=0, 


and consequently its roots are +1 and —1 and its elementary divisors are 





10 Tt can be shown that equations (11.3) possess a solution only for the value of f given 
and that they determine the matrix C to within a factor. 
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simple. Since trace yo = 0, the sum of the roots is zero. A suitable choice of 
coordinate system in the spin space will therefore enable us to take yp in the form 


bs) -@ 


12.1 yo = a 
(12.1) ae ee 














The pointwise invariant spaces of the involution defined by yo are spaces of 
2’! —1 dimensions which we denote by [yo]* and [yo]. The points of these 
axes are characterized by the equations 


[vo]”: vosy” = + y%, 


(12.2) and 
[yo]: Yoav” = Tt v, 


respectively. The representation (9.6) images axes on Q by points of [yo]* and 
fy] in the way stated in the 


TueoreM (12.1). The equations 


(12.3) 5 A® shy = ¥4Coo ¥° 

and their inverses | i 
(12.4) A® = (—1)'Cacs “wy”, 

establish a (1-1) correspondence u 

(12.5) Ay 


between the axes on the quadric in Ps, and points in P21. Under this cor- 
respondence axes belonging to F,. correspond to points of the axis [yo] and azes 
belonging to F_ correspond to points belonging to the axis [yo]. 

If A is an axis on the quadric in P»,_1, its corresponding matrix is, by Theorem 
(9.2), of the form a = || ¥“ys || where Rg is the point with coordinates y“ and 
Ns is the hyperplane with codrdinates gp. By Theorem (11.1) the vector ¢gz is 
determined as a multiple of Cscw° and therefore the point y determines the 
matrix a to within a factor. But the matrix a determines a codrdinate tensor of 
A by the equations (8.8) and hence if y corresponds to any axis under (9.6) 
it uniquely determines the axis. 

To prove the last part of the theorem we apply (10.6) to the matrix a = 





1 yw ; 
4 8) to get you = + a where the plus sign is, used if A = + A“ and 


the minus sign if A® = —A®. Hence yésy? = + ¥4 if A belongs to F,, 
and yony” = — y4 if A belongs to F_. Theorem (10.2) could have been 
used to prove that y lies in either [yo]* or [yo], but the argument just given 
Proves in addition that points corresponding to axes of the same family lie 


in the same axis of yp and points corresponding to axes of different families lie 
In different axes of Yo. 
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The invariance of Rg under yo implies that Rg intersects [yo]’ and [yo) in 
spaces Rglyo]* and Reg[yo] such that 


(12.6) D(Relvo]’) + D(Relvo) ) = D(Rz). 


If B is not an axis, the statement that Rg intersects [yo]" and [yo]~ in spaces of 
the same dimensionality is contained in the 


THEOREM (12.2). Jf D(Rg) > 1, then 
(12.7) D(Relvo]") = D(Relvo]) = 4D(Re). 


If B is not an axis, its polar space does not lie entirely on the quadric and hence 
we may choose a point in it in such a way that its corresponding collineationT 
is non-singular. Of course ! commutes (or anticommutes) with the collineation 
8 corresponding to B. If y = 86 is any point of Rs, Ty = Teé = + B(T) and 
hence I leaves Rg invariant. Since I is an involution of the family X‘y;, it 
anticommutes with yo and hence interchanges [yo]’ and [yo]. Therefore I 
interchanges Rg[yo]’ and Rglyo] and the spaces are of the same dimensionality. 

An (r + 1)-dimensional linear space B; on the quadric is determined as the 
join of an r-dimensional subspace B and a point LZ; in B,; but not in B. The 
spaces corresponding to B and L, determine the spaces corresponding to B, 
in the way stated in the 


THEOREM (12.3). Jf B, = B + I, is a linear space on the quadric, Ly is a 
point, BL, = 0, and B,, B and L, correspond to matrices B,, B and ), respectively, 
then 


(12.8) Re, = RR), and Na, = Nz + N),- 


The second of these equations is obtained from the first by taking polars with 
respect to the invariant polarity. _ Moreover, since 6; = B\1= + 2,8 it is evident 
that Rg, is included in both Rg and Ry, and therefore in RgR,,. Hence it is 
sufficient to prove that D(RsR,,) = D(Rz,). 

Let us choose Lz to be a point on the intersection of the quadric Q with the 
polar of B. This can always be done in such a way that Le is not conjugate to 
I, and then Bz = B + Ly will lie on Q and the line L; + Le will not lie on Q. 
We now take M, and Mz to be two distinct points on L; + Le which are conjugate 
with respect to Q and call their corresponding matrices yw; and wz. Then “= 
a1 + bd: and we = cdr + dds. Since both d, and A» commute with 6 so do 
M; and ue, and hence the non-singular involutions with matrices y: and pe trans- 
form Rg into itself. The same is therefore true of their product, ime. Making 
use of the relation wie = — pei, a computation shows that the pointwise 
invariant spaces of the involution determined by pie are the rank spaces of the 
singular elements, \1 and zs, of the pencil pd: + ore. Hence D(RgRa,) + 
D(RsR,,) = D(Rz). The involution with matrix p; interchanges Ry, and Ry, 
and leaves Rg invariant so that D(RsR,,) = D(RsR,,). Therefore D(RsRa,) = 
3 D(Rs), which is equal to D(Rs,) by Theorem (9.1). 
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13. Gepmetry of a Generalization of the Pliicker-Klein Correspondence. 
The properties of the geometrical correspondence 


(13.1) B- Rs 


between linear spaces on Q and the spaces into which the corresponding matrices 
transform the whole of P2»_; will be made clearer if we discuss some special cases. 

When v = 1 the quadric in P; consists of two distinct points and these points 
correspond, respectively, to two distinct points in the spin space, which is 
again a P;. 

When vy = 2 the spaces P2,_; and P2»_; are again of the same number of dimen- 
sions but the axes on the quadric are now represented by points of two non- 
intersecting lines J, and in the spin space Ps. Two lines on the quadric which 
intersect in a point Py correspond to two points L; and Lz lying on the lines 
|, and iz respectively, and the point Po corresponds to the line L; + Le crossing 
l, and J. 

When v = 3 the quadric is in P; and the spin space is 7-dimensional. The 
axes [yo] and [yo] are non-intersecting 3-spaces. A point on Q corresponds to a 
P; in P; which intersects each-of the 3-spaces [yo]* and [yo] in a line. Two 
points B; and Bz on Q therefore determine a pair of lines in [yo]* and a pair in 
lv] . These two pairs of lines may intersect in points LZ; and L and if they do 
the line B; + Be lies on Q and corresponds to the line L; + Ls crossing [yo]* and 
(vo] . If the plane A = B, + By + B; determined by the three points B,, Be 
and B; lies on Q and belongs to F,, then the points determine in [yo] the edges 
of a non-degenerate triangle while in [yo]* they determine three lines through the 
point corresponding to A under (13.1). 

For vy = 1, 2, or 3 it can be shown that all the points of [yo]* and [yo] occur as 
images of axeson Q. When »v > 8 this is no longer the case, as follows from the 
fact that the dimension of [yo]* is 2”-" — 1 while the dimension” of the family of 
axes F* is 4 »(v — 1). 

In a coérdinate system in which yo is given by (12.1), equations (11.2) and 
(11.3) give yoC = (—1)’Cyo and hence C is of the form 














Ci 0 
(13.2) C= , ; Ci =@é Ci, C: =@é C2, 
0 C, 
when » is even, and of the form 
0 ¢ 
(13.3) t's a ee ee 
C, 0 














when v is odd. The invariant polarity therefore interchanges the axes [y0]* 
and [yo if vis even and leaves them separately invariant if v is odd. 


When y is even a point of [yo]* is transformed into a hyperplane which contains 
eee eapisneinntis 


" Bertini, loc. cit., p. 142. 
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The invariance of Rg under yo implies that Rs intersects [yo]* and [y) in 
spaces Rglyo]’ and Rg[yo]_ such that 


(12.6) D(Relyo]") + D(Relvo] ) = D(Rp). 


If B is not an axis, the statement that Rg intersects [yo]* and [yo] in spaces of 
the same dimensionality is contained in the 


THEOREM (12.2). If D(Rg) > 1, then 
(12.7) D(Rslyo]") = D(Relvo] ) = 3D(Rep). 


If B is not an axis, its polar space does not lie entirely on the quadric and hence 
we may choose a point in it in such a way that its corresponding collineationT 
is non-singular. Of course ! commutes (or anticommutes) with the collineation 
8 corresponding to B. If » = 80 is any point of Rg, Ty = Tp¢ = + B(T8) and 
hence I leaves Rg invariant. Since I is an involution of the family X‘y,, it 
anticommutes with yo and hence interchanges [yo]* and [yo]. Therefore 
interchanges Rg[yo]’ and Reg[yo]_ and the spaces are of the same dimensionality. 

An (r + 1)-dimensional linear space B; on the quadric is determined as the 
join of an r-dimensional subspace B and a point L; in B,; but not in B. The 
spaces corresponding to B and L, determine the spaces corresponding to B, 
in the way stated in the 


THEoREM (12.3). Jf B, = B + Ly, is a linear space on the quadric, L, is a 
point, BL; = 0, and B,, B and L, correspond to matrices B,, B and , respectively, 
then 


(12.8) Ra, = Rsk), and Na, = Nz + N),- 


The second of these equations is obtained from the first by taking polars with 
respect to the invariant polarity. . Moreover, since 6; = BA = + jf it is evident 
that Rg, is included in both Rg and R,, and therefore in RgR,,. Hence it is 
sufficient to prove that D(RsR,,) = D(Rz,). 

Let us choose Lz to be a point on the intersection of the quadric Q with the 
polar of B. This can always be done in such a way that Le is not conjugate to 
L, and then Bz = B + Lz will lie on Q and the line L; + Lz will not lie on Q. 
We now take M, and M¢ to be two distinct points on L; + Le which are conjugate 
with respect to Q and call their corresponding matrices py; and we. Then “1 = 
a, + bre and pe = cdi + dre. Since both A; and A» commute with 8 so do 
Mi and ye, and hence the non-singular involutions with matrices pw; and pe trans- 
form Rg into itself. The same is therefore true of their product, ume. Making 
use of the relation wiwe = — poi, a computation shows that the pointwise 
invariant spaces of the involution determined by pyu2 are the rank spaces of the 
singular elements, \: and ds, of the pencil py + ore. Hence D(RsRa,) + 
D(RsRy,) = D(Rz). The involution with matrix 4; interchanges Ry, and Ra, 
and leaves Rg invariant so that D(RgR,) = D(RsR),). Therefore D(Rsh,) = 
3 D(Rg), which is equal to D(Rs,) by Theorem (9.1). 











arr 3 o 
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13. Gepmetry of a Generalization of the Pliicker-Klein Correspondence. 
The properties of the geometrical correspondence 


(13.1) B-— Rs 


between linear spaces on Q and the spaces into which the corresponding matrices 
transform the whole of P2»_; will be made clearer if we discuss some special cases. 

When v = 1 the quadric in P, consists of two distinct points and these points 
correspond, respectively, to two distinct points in the spin space, which is 
again a Py. 

When v = 2 the spaces P2,_; and P2»_; are again of the same number of dimen- 
sions but the axes on the quadric are now represented by points of two non- 
intersecting lines J, and ,in the spin space P;. Two lines on the quadric which 
intersect in a point Py correspond to two points L; and Lz lying on the lines 
|, and I, respectively, and the point Po corresponds to the line L; + Le crossing 
l, and [,. 

When v = 3 the quadric is in P; and the spin space is 7-dimensional. The 
axes [yo]’ and [yo] _ are non-intersecting 3-spaces. A point on Q corresponds to a 
P; in P; which intersects each-of the 3-spaces [yo]* and [yo] in a line. Two 
points B; and B, on Q therefore determine a pair of lines in [yo]* and a pair in 
lyo] . These two pairs of lines may intersect in points L,; and Lz and if they do 
the line B, + Bs lies on Q and corresponds to the line L; + Le crossing [yo]* and 
lyvo] . If the plane A = B, + Bz + B; determined by the three points B,, B: 
and B; lies on Q and belongs to F,, then the points determine in [yo] the edges 
of a non-degenerate triangle while in [yo] they determine three lines through the 
point corresponding to A under (13.1). 

For » = 1, 2, or 3 it can be shown that all the points of [yo]* and [yo] occur as 
images of axeson @. When v > 8 this is no longer the case, as follows from the 
fact that the dimension of [yo]* is 2”.’ — 1 while the dimension” of the family of 
axes F* is 4 »(y — 1). 

In a coérdinate system in which 7 is given by (12.1), equations (11.2) and 
(11.3) give yoC = (—1)’Cyo and hence C is of the form 














C, 0 
(13.2) C = 7 i Ci = € Ci, C: = ¢ C2, 
0 C; 
when v is even, and of the form 
0 C 
(13.3) ee a. ee Fe, 
C, 0 














when » is odd. The invariant polarity therefore interchanges the axes [yo]* 
and [yo] if vis even and leaves them separately invariant if v is odd. 


When v is even a point of [yo]* is transformed into a hyperplane which contains 
eee 


" Bertini, loc. cit., p. 142. 
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fyo]” and intersects [yo]* in a space of 2” — 2 dimensions. The pojar trans- 
formation of P2-_; induces a polarity within the space lyol* and, similarly, within 
[yo]. 

When » is odd, a point of [y0]* is transformed into a hyperplane which contains 
[yo]. and intersects [yo]* in a space of 2” " — 2 dimensions. The invariant 
polarity therefore defines a mapping of the subspaces of [yo]’ into subspaces of 
[yo] _in which points correspond to spaces of 2”-* — 2 dimensions, lines correspond 
to spaces of 2” * — 3 dimensions, and so on. In particular, if R. is an axis in 
P.»_, corresponding to a point on Q, the mapping carries R.[yo]" into Raly). 
Since R, = Ralyol” + Ralyo] , the spoce Ra is determined by its intersection 
with [yo]. Similarly, a point of [yo] is determined by the (2”"* — 2)-dimen- 
sional space in which its polar hyperplane intersects [yo]". These results can be 
combined with the theorems of §12 to give the 


THEOREM (13.1). If vis odd and 2 3, the points of a quadric Q in P2,_; may 
be made to correspond to axes in Po-1 — 1 in such a way that the points of an 
(r — 1)-dimensional space, r < v, on Q correspond to axes all of which contain the 
same (2”-*" — 1)-dimensional space. Under the correspondence the points of an 
axis on Q either correspond to azes all of which contain the same point or to azes all 
of which lie in the same hyperplane. 

For v = 3 this theorem gives the Pliicker-Klein correspondence between 
points on a quadric in P; and lines in P;. In this case all the lines in P; enter 
into the correspondence. Lines on the quadric correspond to pencils of lines in 
P; and the points of a plane on the quadric correspond to all the lines through a 
point or to all the lines in a plane. 


14. Collineation Representation of H2, for vy > 2. The geometrical corre- 
spondences established in this chapter give rise to representations of the proper 
orthogonal group in 2y variables, Hz,. In this and the next two sections we 
shall discuss these representations. 


A collineation of the spin space, with matrix P = ||’ P*, ||, will leave invariant 
the linear family of involutions X‘y; if 
(14.1) P(X‘y,)P"* = Y"y, 


for arbitrary values of the parameters X‘ of the family. From (9.2), trace 
yy; = 2’6;. Hence, multiplying both members of (14.1) by y‘ and taking the 
trace gives 


(14.2) Y* = Ljx?, 
where 
(14.3) L} = 2°” Trace (y‘Py;P™’). 


The collineation with matrix P therefore transforms the matrices 7; into 
(14.4) n= Py,P™ - ysLi, 


and induces in P2,_; the collineation (14.2). 
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The quadric is left invariant by (14.2) since it is the locus of singular involu- 
tions of the linear family. Indeed, in a cartesian codrdinate system we have 


4.8) D(X) 1 = (X40)? = P“(X'g)? P = PUY.) P = DY), 


and consequently the linear transformation (14. 2) leaves the quadratic form 
vii ‘¥/ invariant as well as the quadric y;;X'X’ = 0. Hence 


(14.6) YrsLi Lj = Yii, 
and taking determinants : 
(14.7) |L}|= +1. 


A matrix || Lj || satisfying (14.6) will be called proper orthogonal if its deter- 
minant is + 1 and improper orthogonal if its determinant is — 1. The set of all 
orthogonal matrices form the orthogonal group H:, and the proper orthogonal 
matrices constitute an invariant subgroup H3,. 

If || L} || satisfies (14.6) and the codrdinate system is cartesian so that 7; 
satisfy (7.3), then the matrices n; = y,L; also satisfy (7.3). A successive special- 
ization of the codrdinate system will suffice to show that any ordered set of 2v 
matrices of order 2” satisfying (7.3) can be simultaneously transformed by a 
similarity into the canonical forms given by (7.4) and (7.5). Hence if || L} || 
is any orthogonal matrix there exists a matrix” P = || P*, || such that 
Li = PyP™. 

Moreover, if Ry;R™’ = Py;P™, then PR commutes with the matrices 7; 
and consequently with ali the matrices (8.3). Since these form a basis for the 
full matrix algebra, PR = pl and R= pP. The orthogonal matrix || L; || 
therefore determines P to within a factor by means of equations (14.4). This 
completes the proof of the 


THEOREM (14.1). Equations (14.4) (or (14.3)) establish a (1-1) isomorphism 
(14.8) || L5 Il <> e || P*s || 
between the group of orthogonal matrices of order 2v and the group of collineations of 


Py_s which leave the linear family X*y; invariant. 
Using (14.4) and the definition of yo, a computation gives 


(14.9) PyP™ = | Li | yo 


and hence if P corresponds to a proper orthogonal matrix it commutes with Yo 
while if it corresponds to an improper orthogonal matrix it anticommutes with 
»- In the coérdinate system of (12.1), 


(14.10) P= i” 





er 


a nad 





“This also follows from the theorem that the full matrix algebra allows only inner 
automorphisms. For the details of this proof see the paper of Brauer and Wey] cited above. 
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if the collineation corresponds to a transformation of Hz,, and 
0 Pi 


14.11 P= 
‘ Pz, 0 














if the collineation corresponds to an improper transformation of Hp,. 

It is evident that collineations of the type (14.10) leave [yo]* and [y]~ sepa- 
rately invariant while collineations of the form (14.11) interchange them. 
Using the (1-1) correspondence of Theorem (12.1) between axes on the quadric 
and points in [yo]* and [yo] we get the 


THEOREM (14.2). Collineations in P21 defined by proper orthogonal matrices 
(| Li | = + 1) leave the two families of axes on the quadric separately invariant 
while collineations defined by improper orthogonal matrices interchange them. 

If all the matrices of a group are of the form (14.10), elementary arguments 
suffice to show that the matrices P,. form a group which is isomorphic (perhaps 
multiply) with the original group. Combining this isomorphism with (14.8) 
we get the isomorphism 


(14.12) || Lj || > pPs 


between Hz, and a collineation group in Ps»-1_;. Moreover, two different 
matrices || L} || and || Mj} || which correspond to the same pencil pP.., induce 
the same permutation of points in [yo]" and hence the transformations of P2,-; 
which they define effect the same permutation of the axes of F,. If» > 2, 
Theorem (6.7), states that the two transformations must be identical. This 
will be the case only if || Lj || and || Mj || are proportional. Since both the 











matrices are orthogonal, || Mj || = + || L} ||. 
The matrix yo anticommutes with all the matrices y; so that yoyo = — 7: 
Comparing this with (14.4) we see that under (14.8) 
F 1 O 
(14.13) ll — 3; || <> eyo = p mM 
Hence if || Lj || corresponds to pP.,. under (7.8), || — L} || also corresponds to the 


same pencil of matrices. The isomorphism inverse to (7.8) is therefore (1-2) 
and we have the 


TuroreM (14.3). If v > 2, the matrix group H3, is (2-1) isomorphic with a 
collineation group in Po»-1_,. 

The theorem cannot be extended to the case vy = 2 since the proper quadric 
group in three dimensional space is a six parameter group while the entire 
collineation group on the line involves only three parameters.” When v = 3 





13 See, however, Geometry of Two-Component Spinors by O. Veblen, (Proc. Nat. Acad., 


‘19, 1933, pp. 462-474) where it is proved that the proper collineation group of a real non- 


ruled quadric in 3-space is (1-1) isomorphic with the collineation group on the complex 
projective line. The geometric correspondence underlying this isomorphism is the repte- 
sentation of the points of the complex projective line by means of the points of a sphere in 
real 3-space. 
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the theorem gives the group isomorphism associated with the Pliicker-Klein 
representation of the lines of 3-space by points of a quadric in Ps. 


15. Matrix Representation of Hz, for v > 2. Equations (14.4) and (14.9) 
may be combined into the equations 


(15.1) Py. a vel’, (a, B = 0, 1, "re 2 2v) 
¢ we define L? = Li = Oand Lp = 1. Equations (14.3) combine to give 
(15.2) (Cyp)’ = f(Crs), (8 = 0,1, --- , 2v) 


and multiplying through by L’. and summing on £ gives 
(CPyaP')’ = fC PyeP™, or ((P’CP)ya)’ = f(P'CP) ya. 
Hence 
C"(P'CP)ya = fC'((P'CP) ya)’ = f((P’CP)y.C"")’ 
= ((P’ CP)C ya)’ = yaC"'(P’CP), 


where we have used the relation C’ = e C. 

The matric C~’(P’CP) therefore commutes with all the matrices y; and is 
consequently a multiple of the identity matrix. That is, P’/CP = pC and the 
collineation with matrix P transforms the invariant polarity into itself. We 
may therefore normalize the matrices P entering into the isomorphism (14.8) 
by the requirement that 


(15.3) P’'CP = C. 


This normalization selects out of each pencil pP a pair of matrices which differ 
only in sign. 


The collineation representation (14.8) now becomes the (1-2) matrix rep- 
resentation 


P, 0O 
Pus 


of proper orthogonal matrices of order 2v by matrices of order 2’. We may 
again conclude that the matrices + P, form a group isomorphic (perhaps 
multiply) to the group of matrices + P, and so obtain the isomorphism 


(15.5) || Lj || > +& Py. 


(15.4) |Li|loarP=+ 














We shall refer to this correspondence between proper orthogonal matrices of 
order 2y and matrices of order 2”~' as the representation A... 

We shall now determine the nature of the correspondence inverse to (15.5) 
for values of v greater than two. Since the isomorphism of Theorem (14.3) is 
(2-1) the orthogonal matrices corresponding to a given pair of spin matrices 
+P, are at most the two matrices L = || Lj || and —L. Moreover, if +P, 
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corresponds to both —Z and +L, then +(P,)(P;) = +1,,_1 corresponds to 


—L(L) = — || 6; ||. Conversely, if — || 6} || corresponds to +1,,_,, then 
both L and —L correspond to the same pair of.spin matrices +P... 
By (14.13) the orthogonal matrix — || 6; || corresponds to two matrices (of 


order 2”) out of the pencil pyo. To determine the values of p for which the 
matrix pyo satisfies the normalizing condition, we make use of equations (11.2) 
and (11.3) to get 


(15.6) [((—1)"vel'Cl(—1)"vo] = C. 
Hence we have for » odd, 
(15.7) || — 85 || > Fly; 


and for v even, 
(15.8) | — 6; || > +1,,4. 


The properties of the representation A; therefore depend essentially upon 
whether » is odd or even. 


TuEoreEM (15.1). Ifv > 27s odd, A, tsa (1-2) representation in which 
|| Lj || P+, 


(15.9) and 
— || Lj || +iPy. 


The group H?, contains with each matrix || L; || its negative and no other matrices 
proportional to it; the group G of matrices of order 2”* contains with any matriz 
P.. also the matrices —P.,, iP. and —iP, and no other matrices proportional to 
P,. Itis not possible to sharpen A, to a (1-1) representation. The group G does 
not contain a proper subgroup in which there occurs at least one out of each set of 
four matrices +P, +iP,. 


In order to sharpen A, to a (1-1) representation it would be necessary to 
select a single matrix of order 2”* to correspond to each of the proper orthogonal 
transformations ; 


(15.10) Y'=—xX', Y*=-xX* Yia xX, Yeu X..., Y= 2’, 
and 
(“ip Yee wee Fue Fak... Y=’. 


In a cartesian codrdinate system these transformations correspond to collinea- 
tions defined by pyry2 and pyrys._ A (1-1) representation would therefore include 
two partial matrices, say (ayryz), and (byrys);, out of these pencils. These 
partial matrices anticommute while (15.10) and (15.11) commute, so that the 
(1-1) representation would break down for their product. 

Any subgroup of G containing at least one out of every set of four proportional 
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matrices would contain the matrices (ayry2)+ and (byrys).. for some choice of the 
scalars a and b. Hence the subgroup would contain the matrix (ayry2). 
(byivs)+- (ayvy2)+(byrys)+, Which is equal to —1. If the subgroup contained 
three matrices out of any one pencil, it would contain all four matrices of the 
pencil and therefore all the elements of G. 

The only remaining possibility is for the subgroup to contain one of the two 
pairs +P, and +7P, out of each set of four proportional matrices. Such a 
selection of matrices out of G would lead to the determination of a subgroup of 
Hj, which would contain one out of every pair of orthogonal matrices + || L} ||. 
Such a subgroup cannot exist for it would necessarily contain either 


(15.12) — —— — —— — — 














or its negative and also the square of the matrix chosen, which is in both cases — 1. 

This proof shows, incidentally, that H3, is not the direct product of the two- 
element group consisting of 1 and —1 and a proper subgroup of itself. We 
may also conclude from Theorem (15.1) that no identification of the matrices 
|| Lj || and — || Zi || can lead to a (2-2) isomorphism of the sort given in the 
following theorem. 


THEorEM (15.2). Ifv > 2 is even, A, is a (2-2) representation in which 
(15.13) + || Lj || ao +P,. 


Itis not possible to sharpen A, to a (2-1), (1-2) or (1-1) representation. 

The first part of the theorem follows immediately from (15.8) and the pre- 
ceding discussion. A (2-1) representation cannot be obtained from A, since 
the group of matrices of order 2” contains elements which anticommute. 
A (1-2) representation cannot exist since it would imply that Hz, is the direct 
product of the two element group and a subgroup of itself. Finally, a (1-1) 
lepresentation cannot be obtained from A, for the commutative transformations 
(15.10) and (15.11) correspond to anticommuting matrices. 


16. Matrix Representation of Hi and Hi. We complete the discussion of 
A. by treating the cases y = 1 and vy = 2 in the two following theorems. 
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TuroreM (16.1). If the quadratic form in E2 is taken to be X'X’, the proper 
“orthogonal” matrices are of the form 






































a 
(16.1) L; || = | 
I| Z; || Site 
and the representation A. is the (1-2) correspondence 
(16.2) || Lj || > + Va. 
To prove this theorem we observe that when vy = 1, C = | so that 
—c 
, -_ Pp ‘ 
(15.3) is the condition | P| = +1. Hence P = Taking X‘y; = 
0 1/p 
0 xX’ a aii 
x , we have (X'y;)) = X Xl. The equations of the isomorphism, 
P(X'y)P™ = Y’y;, are then 
(16.3) Y' = p'X' and Y? =X? 
* Pp 
: a p 0 i 
Hence | LZ; || = =: ,|| and p=+ Va. 
0 l/a 10 1/p 




















TuEorEM (16.2). A proper orthogonal matrix || L} || of order four corresponds 
under (15.4) to a patr of matrices 


A 0 
(16.4) +P=+ 
B 
where| A| =|B|=1. Conversely, if A and B are any two unimodular matrices 


of order two, then.the matrices (16.4) correspond to a proper orthogonal matrix 
under (15.4). The representation A, is (2-0). 

When » = 2, the matrix C is skew-symmetric and since it is of the form (13.2) 
we must have 


(16.5) Lik palm till al 














| — C2 0 


so that the matrices (16.4) satisfy the normalizing condition if and only if 
|A|=|B|=1. 
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It is a theorem of 3-dimensional projective geometry that a collineation may 


per sage oe a 
be constructed which simultaneously effects arbitrarily given collineations on 
the two reguli of lines on a quadric. Since we have represented the lines on Q 
by points of [yo]’ and [yo] , the matrices A and B describe the collineations on 
the two reguli and may be taken to be any two unimodular matrices. We could 
also show this directly by proving that every matrix of the form (16.4) trans- 
forms the linear family of collineations into itself. 
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INTRODUCTION 


The critical points of a real single-valued differentiable function f(z) of n 
real variables (x) are the points at which all of the partial derivatives of f 
vanish. Critical points present themselves in many different ways. If V(z, y, z) 
is a Newtonian potential function the critical points of V are the points where 
the gravitational force is null. In such examples critical points appear as 
points of equilibrium. They also appear in problems involving the equilibria 
of floating bodies, and more generally in any problem in dynamics where there 
is a generalization of potential. 

The following theorem in geography is an old and interesting example of 
critical point theory. Suppose the surface of the earth is such that the level 
points are finite in number. Let the points of minimum and maximum distance 
from the center of the earth be respectively po and pe in number. Let the 
number of passes or saddle points be p,. If these saddle points are of the 
“general”? non-degenerate type, then 


(0.1) Po — Pi t+ po = 2. 


This example will suggest the following questions. Is (0.1) the only relation 
between the numbers p;.involved? What happens in n-dimensional spaces? 
What becomes of the theorem when the number of critical points is infinite? 
And still more generally when one turns to the critical points of functionals, 
for example to extremals in boundary problems in the calculus of variations, 
what becomes of the classification into types of critical points? 

As in the study of functions in general, the theory depends in its superficial 
form upon the class of functions presented. These functions may be analytic, 
finitely differentiable, continuous, or even semi-continuous. The answer to the 
first of the preceding questions may be given by presenting a general theorem 
in the so-called non-degenerate case.’ 

Suppose that f is a single-valued function of a point p on a general n-dimen- 
sional manifold R with local coordinates (x). Cf. §10. We suppose that f is 
of class C* in terms of the coordinates (2). We assume that f is non-degenerate, 
that is that the Hessian of f vanishes at no critical point of f. We assume that 
R is compact. It follows that the number of critical points of f is finite. Let 





1 Morse, M. and van Schaack, G. B., The critical point theory under general boundary 
conditions, Annuals of Mathematics, 35 (1934). 
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(a) be a critical point of f. By the index of (a) is meant the index” of the quad- 
ratic form Q whose coefficients form the Hessian of f at (a). The index is the 
number of minus signs in the usual canonical form for Q. We see that there 
are n + 1 possible values for the index. This is the simplest form of classifi- 
cation of critical points. It corresponds in dynamics to various degrees of 
instability at the point of equilibrium. With this understood we state the 
following theorem. 

TuEorEM 1. The numbers p; of critical points of index i of the non-degenerate 
function f, and the connectivities R; of R satisfy the following relations: 


po = Ro 
Pi — Po = ki — Ro 
(0.2) P2 —- Pr +p = Re — Ri + Ro 


a 


Pn — Prat °° +(—1)"po = Rn — Raia +--- + (—1)"Ro. 


Upon considering the successive relations in (0.2) one finds that 


(0.3) p 2 R; (@ = 1, ---,n). 
Relations (0.3) imply the existence of what we may call the topologically 
necessary critical points. If, for example, R were a torus the relations (0.3) " 


would imply that p 2 1, p:. 2 2, and pe 2 1. The relations (0.2) however 
contain much more. They imply relations between the numbers p; in excess 
of those which are topologically necessary. If one is dealing with the problem 
under general boundary conditions there is a sense in which the preceding 
relations are complete. See M1, p. 145. We shall leave the non-degenerate 
problem for the present. 

The analogue of the critical point in the calculus of variations is the extremal 
satisfying given boundary conditions, for example joining two points. The 
definition and evaluation of the corresponding indices of these critical points 
have been taken up in M1. Studies of this sort are not the main object of 
this memoir. 

Instead we recall that the theory of functions and functionals can be unified 
by the study of functions f of a point on an abstract space. Sufficient generality 
will be obtained if the functions f are lower semi-continuous and the space is a 
general metric space. See §1. The curves admitted in the calculus of vari- 
ations become points in this abstract space. The theory as applied to functions 
and to functionals would still be fairly divergent were it not for the fact that 
tis possible to introduce a topological definition of a critical point. In the 
special case of a function which is continuous in the neighborhood of a point 
a) of a coordinate space (x), this definition is as follows: The point (a) is non- 
mueal if there exists a continuous deformation of the domain f S f(a) neighboring 





*See Morse, M., The calculus of variations in the large, American Mathematical Society 


Colloquium Publications (1934). We shall refer to these lectures as M1. 
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(a) under which f is decreased whenever a point p is displaced, and under which 
the point (a) is actually displaced. 

By far the greatest difficulty and the greatest interest arise when one admits 
the cases where the number of critical values is not finite. The corresponding 
numbers m; and R; will then in general be infinite. They can be defined as 
cardinal numbers, but the real generalization lies deeper. It depends upon the 
fact that the entities p; are merely dimensions of groups M; and that these 
groups can be compared at most superficially by comparing their dimensions. 
The real comparisons are by means of isomorphisms termed “natural” because 
they are limited in character by the nature of f. In this part of the theory 
we shall find that the relations (0.2) are replaced by an infinite set of relations 
of the same formal character, with the numbers p; and R; replaced by groups, 
and the signs + and — replaced by suitably defined operations in abelian group 
theory of the nature of reductions with respect to a group modulus. The 
relations (0.2) in the non-degenerate case are a very special case. Even in 
the non-degenerate case the group theory relations contain new general results. 

An extensive bibliography cannot be given here. The reader may however 
turn to M1 and to the papers’ cited below for further references. References 
will be found in M1 to Birkhoff, Brown, Fréchet, Lefschetz, Hadamard, Menger, 
Poincaré, Schnirelman and Lusternik, Tonelli, and others. 

The results here presented form a part of a set of lectures given at the In- 
stitute for Advanced Study at Princeton during the year 1935-36. Among 
those who were present and made helpful suggestions were Drs. Reinhold Baer 
and E. Cech. See §5. The author’s assistant, Dr. Pitcher, rendered valuable 
service in the reading and preparation of the manuscript. 


I. THe UNperRtyinG Torpotocy AND Group THEORY 


1. The space M and function F. Let M bea space of elements p, 9, 7, °°’ 
in which a number pq is assigned to each ordered pair of points such that 


I. pp = 0, 
Il. pq ¥ Oif p ¥ gq, 
Ill. pr S pq + r9@. 





* Some of the more recent references are as follows: : 

Birkhoff, G. D. and Hestenes, M., Generalized minimaz principle in the calculus of vari- 
ations, Duke Mathematical Journal, 1 (1935), 413-432. 

Lefschetz, 8., On critical sets, Duke Mathematical Journal, 1 (1935), 392-412. 

Menger, K., Untersuchungen tiber allgemeine Metrik, Mathematische Annalen, 100 (1928), 
75-163; 103 (1930), 466-501; Metrische Geometrie und Variationsrechnung, Fundamenta 
Mathematicae, 25 (1935), 441-458. ‘ 

Morse, M. and van Schaack, G. B., Abstract critical sets, Proceedings of the National 
Academy of Sciences, 21 (1935), 258-263. ; 

Morse, M. and van Schaack, G. B., Critical point theory wnder general boundary condi- 
tions, Duke Mathematical Journal, 2 (1936). 

Morse, M., Functional topology and abstract variational theory, Proceedings of the Na- 
tional Academy of Sciences, 22 (1936). , 

Lusternik, L. and Schnirelman, L., Méthodes topologiques dans les problémes variationnels, 
Paris, Hermann & C*. 
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Upon setting r = p in III we see that pg 2 0 and upon setting p = q that 
gr = rg. The space M is termed a metric space. It is called symmetric 
since gr = 7g. The elements p, qg, 7, etc. are termed “points” and pq the 
“distance” from p to q. 

Neighborhoods, limit points, sets relatively open or closed, can now be 
defined in the usual way.* In particular if e is a positive number the e-neighbor- 
hood A, of a point set A shall consist of all points p with a distance from A 
less than e. A set B C M will be said to be compact if every infinite sequence 
in A contains a Cauchy subsequence which converges to a point in A. We 
do not assume that the space M is compact. 

Let f(p) be a real single-valued function of the point p on a subset A of M. 
We include +2 among the values which f may assume. We say that f is 
lower semi-continuous at a point g of A if corresponding to each constant c < f(q), 
there exists an e-neighborhood q, of q such that f(p) > c on the intersection 
A-q, of A and q.. Upper semi-continuity of f at q is similarly defined, re- 
versing the preceding inequalities. We say that f(p) is continuous at q if its 
value at q is finite and f(p) is both upper and lower semi-continuous at q. 

We shall consider a function F(p) lower semi-continuous on M. We shall 
suppose that 


(1.1) 0sFsz1 


and that for any constant c such that 0 S c < 1 the domain F <S c is compact. 

The functions f which we shall encounter in our abstract variational theory 
will be bounded below and lower semi-continuous on M. They may take on 
the value +0. For each finite constant b the domain f < b will be seen to be 
compact. Such functions f lead to functions F as follows. Let m be the abso- 
lute minimum of fon M. Sety = f — mand 


1. pre 4 

(1.2) F, rx; 

understanding that F; = 1 when gy = +. The function F; defined by (1.2) 
is readily seen to have the properties ascribed to the function F. 

The set of points at which F < aor F S a will be denoted respectively by 
F,or Fy. If A is an arbitrary point set of M we shall say that A is “definitely 
om” F, (written on, Fa) if F is less than a constant c < a at points of A. We 
note that A may be on F, without being on, F, even when A is closed. To 
illustrate this consider the following example. 

Example 1.1. Let M be the curve 


(13) + (y—4) =4 


and suppose that F = y at each point (z, y) of M except at the point (0, 1). 
At (0, 1) let F = 0. The space M is on F < 1 but not on, F < 1. 


SS 
‘See Hausdorff, F., Mengenlehre, Leipzig, Walter de Gruyter (1927). 
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2. Vietoris cycles. We shall use Vietoris’ cycles in our topology. Singular 
cycles in the classical sense are inadequate in a number of ways. We shall 
illustrate this fact by an example. Let V be a compact subset of M and V, 
the e-neighborhood of V. Let wu be an arbitrary singular k-cycle not on V. 
Corresponding to each positive e we suppose that u is homologous to a cycle 
on V,. It is not always true that u is homologous to a cycle on V as examples 
would show. The corresponding theorem for Vietoris cycles however is true. 

We proceed with a systematic outline of the Vietoris theory generalized and 
modified to meet our needs. 

Let (A) be a set of k + 1 points of M. For k > 0 the orderings of (A) will 
be divided into two classes, any ordering of one-class being obtained from any 
other of the same class by an even permutation. The set (A) taken with one 
of these classes of orderings will be termed a positively oriented k-cell and taken 
with the other class a negatively oriented k-cell. An oriented k-cell a, may 
be represented by a succession 


(2.0) n Ag -++ Ax 
of its vertices preceded by », where 7 = 1 or —1 according as the ordering 
Ao +: Ax belongs to a, or not. In case k = 0 we use (2.0) to define an oriented 


0-cell understanding the cell is positively or negatively oriented according 
as nis 1 or —1. We say that the oriented (k — 1)-cell 


n(—1)* Ao ++ Ag a-Aigs +++ Ag 


is positively related to a. We shall say that a; is of norm e if the distances 
between the vertices of a, are less than e. 

The ce’l a, will be termed degenerate or null if at least two of its vertices are 
coincident. Let 5 be an element in an arbitrary field’ A. By a k-chain of 
norm ¢ is meant a symbolic’ sum 2 of the form 6‘a‘, i = 1, «+: , m, in which 
5‘ is an element of A and a’ is an oriented k-cell of norm e. The chain 2 will 
be termed reduced if none of the a’ are null, if each is positively oriented, and 
no two of the a’ are identical. 

An arbitrary chain will be reduced as follows. Let a be an arbitrary posi- 
tively oriented k-cell and b the corresponding negatively oriented k-cell. Any 
term of the form 6b in = will be replaced by —éa. All terms involving a 
will be replaced by a single term 6’a in which 6’ is the sum of the coefficients 6 
in the terms replaced. Finally all terms involving null cells or null coefficients 
will be dropped. The resulting reduced chain will be regarded as formally 
equal to the original chain 2. 





5 Vietoris, L., Uber den héheren Zusammenhang kompakter Réume und eine Klasse von 
zusammenhangstreuen Abbildungen, Mathematische Annalen, 97 (1927), 454-472. 

See also Lefschetz, 8., Topology, American Mathematical Society Colloquium Publi- 
cations, New York (1930); and Deformations, Duke Mathematical Journal, 1 (1935). 

® See van der Waerden, B. L., Moderne Algebra, p. 41, Berlin, Julius Springer (1930). 
7 We adopt the summation conventions of tensor analysis. 
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By the sum of two chains 6‘a‘ and 6‘ a" is meant the chain 3‘a‘ +6'a". We 

assume that 

8[5'x"] = (68')z*. 
It is readily seen that k-chains of norm e form an additive abelian operator 
group.” 

We shall now define the boundary operator 8. If a is an arbitrary oriented 
k-cell with k > 0, and e is the unit element in A, ea will be a k-chain and A(ea) 
shall be the (k — 1)-chain >; eb; where b; is an arbitrary (k — 1)-cell positively 
related toa. More generally we set 


B(6‘a") = 8'[B(ea’)). 


If k = 0 we understand that B(ea) = 0. If u is an arbitrary k-chain one sees’ 
that BBu = 0. One terms Bu the boundary chain defined by u. 

The preceding k-chains are termed algebraic k-chains to distinguish them 
from Vietoris chains to be defined later. The term algebraic will be omitted 
when it is clear from the context that the chain is algebraic. In particular 
this will be the case whenever the norm e of the chain is mentioned. 

Let B and C be compact subsets of M such that B C C. An algebraic 
k-chain u will be termed a cycle modulo B on C if Bu is on B. We term u an 
absolute k-cycle or a relative k-cycle according as B is or is not null. The 
adjective absolute will ordinarily be omitted. We shall term u, e-homologous 
to 0 modulo B on C and write 


u~,.0 (mod B on C) 


if there exists a (K + 1)-chain z of norm e on C such that Bz = u + v, where 
visa k-chain of norm e on B. If B = 0, then v = 0 and the phrase mod B 
is omitted. 

Let u = (Un) be a sequence of algebraic k-cycles u,,n = 0, --- ,mod BonC, 
with norms at most é, respectively. If the numbers e, tend to zero as n becomes 
infinite, and if for each integer n there exist “connecting” homologies of the form 


(2.1) Un ~e, Unyt (mod B on C), 


wis termed a (Vietoris) k-cycle mod B on C, and C a carrier of u. We term u 
homologous to zero mod B on C, and write u ~ 0 mod B on C if corresponding 
to each positive number e there exists an integer N so large that u, ~. 0 
mod Bon C whenever n> N. The set C is termed a carrier of the homology 
“~Omod Bon C. Vietoris k-cycles u and v are termed homologous, u ~ 2, 
mod B on C, if wu — » ~ 0 mod B on C. (See below for definition of u — v.) 


The algebraic k-cycles u, will be termed the components of u = (un). If * 


“and v are Vietoris k-cycles mod B on C the algebraic k-cycles u, + 0, are 


the components of Vietoris k-cycle which we denote by u + v. If 6 is an ele- 
Sse eeeeeieccensninenties 


tee van der Waerden, l.c. pp. 132-144. 
See Seifert-Threlfall, Lehrbuch der Topologie, p. 60, Leipzig, Teubner (1934). 
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ment in the field A the algebraic k-cycles du, are the components of Vietoris 
k-cycle which we denote by 6u. 

In the remainder of this memoir the term k-cycle mod B on C shall mean a 
Vietoris k-cycle mod B on C unless otherwise stated. 

If (un) is a k-cycle mod B on C any infinite subsequence (v,) of (un) defines 
a k-cyclev ~umod BonC. With this understood let ¢ be an algebraic k-cycle 
mod B on C of norm e such that u, ~. ¢ mod B on C for all integers n exceeding 
some integer VN. We then write 


(2.2) ume (mod B on C). 


By virtue of these conventions we have the following lemma. 
Lemma 2.1. If u = (un) and v = (vn) are k-cycles mod B on C such that for 
every n, 


(2.3) Un en Un (mod B on C) 


where e, tends to zero as n becomes infinite, then u ~ v mod B on C. 

In the preceding paragraphs we have spoken of algebraic and Vietoris k- 
cycles and homologies mod B on C where B and C were compact. If the phrase 
mod B on C is replaced by mod B’ on C’ where B’ and C’ are compact or not 
at pleasure, mod B’ on C’ shall mean mod B” on C”’, where B” and C” are 
respectively compact sets on B’ and C’. 

Letting X, denote the e-neighborhood of X we state the following lemma. 

Lemma 2.2. If u = (un) is a k-cycle mod B on C such that u ~ 0 mod B, 
on C. for every positive e, then u ~ 0 mod B on C. 

Corresponding to an arbitrary positive e there exists by hypothesis an integer 
N so large that forn > N, un ~. 0 mod B, on C,.. For such values of n we 
infer the existence of an algebraic (k + 1)-chain w, on C, of norm e and an 
algebraic k-chain v, on B, of norm e such that 


Bw, = Un + Vn. 
Let v,, be the algebraic k-chain obtained by replacing each vertex of v, not 
on B by a nearest point on B. It is easily seen that there exists an algebraic 


(k + 1)-chain z, on B, of norm 3e such that Bz, = vn — Un. If we set w, = 
Wn + Zn we find that 


/ / 
Bw, = Un + Vn. 


We now replace each vertex of w,, on C, but not on C by a nearest vertex on C 
obtaining thereby a (kK + 1)-chain y, on C of norm at most 5e. We have the 
relation ‘ 


BYn = Un + vr 
and since v, is on B, uw ~ 0 mod B on C. 
3. Theorems on Vietoris k-cycles. Let g be an arbitrary additive abelian 


operator group with coefficients in a field A. With such a group there can be 
associated in many ways a maximal linear set of elements of g, that is a set 8 
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of elements of g no proper linear combination of which with coefficients in A 
is null, and which is a proper subset of no other such set. The cardinal number 
m of the set S equals that of any other maximal linear set of g. This is readily 
proved in case m is finite. It is true even when m is infinite but we shall make 
no use of this fact. We term m the dimension of g. 

Let V be a compact subset of M. We are dealing with Vietoris k-cycles. 
If u were an ordinary singular k-cycle homologous to zero mod V it is clear 
that uw would be homologous to a singular k-cycle on V. If however wu is a 
Vietoris k-cycle the corresponding theorem requires proof. Theorems 3.1, 
3.2, and 3.3 are general theorems of this nature. If the field A consisted merely 
of the integers mod 2, use could be made of the compactness of the relative 
homology groups to prove these theorems. But with our general group A the 
proof is more difficult. 

The basic ideas appear in two lemmas whose proof was communicated to 
the author in slightly different form by Professor E. Cech. Professor Cech 
had previously developed similar lemmas and theorems in his abstract theory 
of spaces.” 

We begin with a definition. 

Reduction sets W. Let U C V be compact subsets of M. Corresponding 
to each positive 6 let W(6) be a group of algebraic k-cycles mod U on V of 
norm 6, with the property that for each positive » < 6, W(n) is a subgroup of 
W(5). Such a set of groups W(6) will be called a reduction set W. 

The first lemma here is as follows. 

Lemma 3.1. Corresponding to an arbitrary positive constant e there exists a 
positive constant 6 < e such that to each cycle w(6) in W(6) there corresponds for 
every n < 6 at least one cycle w(n) in W(n) such that 


(3.1) wld) ~. w(n) (mod U on V). 


let en =e3 ". Let w denote the subgroup of cycles of W(e:) which are e- 
homologous to zero mod U on V. Let h,; denote the group W(e:) mod w. We 
shall begin with a proof of statement (a). 

(2). The dimension of h; is finite.” 

Let A be a finite net of points on U within a distance e; of each point of U. 
let BD A be asimilar net for V. Let g denote the group of algebraic k-cycles 
mod A on B. It is clear that the dimension of g is finite. 

Corresponding to an arbitrary algebraic k-cycle w = w(e:) of W(e:) we can 
obtain an algebraic k-cycle u of g of norm e = 3e; by replacing each vertex of 
von U by a nearest point of A and each vertex of w not on U by a nearest 
point of B. We infer that w ~, u mod U on V. But the group g admits a 
finite base with elements z‘ so that u = 6'z‘ and hence 

wr, dz (mod U on V). 


a 


a tiesistteticesiiiaa 


Cech, E., Théorie générale de Vhomologie dans un espace quelconque, Fundamenta 
athematicae, 19 (1932), 149-183. 
" Cf. Vietoris, l.e., p. 461 (3). 


M. 
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It follows that the dimension of h; is at most the number of elements z‘, and 
hence is finite. 

The proof of (a) is complete. 

Recall that hi; is a group of classes of k-cycles. For each integer n > | 
let h, be the subgroup of those classes of h; which contain at least one cycle 
of Wen). We see that 


hiD he D hs D 
There must accordingly exist a finite integer r such that 
(3.2) dim h, = dim hyp41 = +--. 


But two operator groups with coefficients in a field and with equal finite dimen- 
sions will be identical if one group is a subgroup of the other. Hence 


(3.3) h, = Arai = °°. 


The constant e was arbitrary and e, = e3 ". We now let 6 be any positive 
constant such that 6 < e;.. The cycle w(6) of the lemma is in W(e,) and hence 
in some class of h,. Corresponding to the constant » < 6 of the lemma let p 
be an integer so large thate, < 7. Thene, < 6 < e,sothat p > randh, =h,. 
The cycle w(6) is in a class of h, and hence of h, , and by virtue of its definition 
this class of h, contains at least one k-cycle w(e,) of W{e,). Since e, < 7 
the cycle w(e,) is a cycle w(n). The cycles w(6) and w(n) are in the same class 
of h, and hence of h;. The relation (3.1) is satisfied by virtue of the definition 
of hy 

The proof of the lemma is complete. 

We now define ensembles Z closely related to the reduction sets W. 

Ensembles Z. Corresponding to each positive 6 let Z(6) be a non-null 
class of algebraic k-cycles z2(6) mod U on V of norm 6. The ensemble of the 
classes Z(5) as 6 ranges over all positive values will be denoted by Z and assumed 
to have the following properties: 

(1). For an arbitrary » < 6, Z(n) shall be a subset of Z(6). 

(2). With Z there shall be associated a reduction set W such that if 2()) 
is an arbitrary element of Z(6) any other element of Z(é) is of the form 


(3.4) z(5) + w(6) [w(5) C Wd), 


and every sum of the form (3.4) is an element of Z(6). 

We shall illustrate the ensembles Z and reduction sets W by the following 
example. 

Example 3.1. Let UC V CC be compact subsets of M. Let u be a Vietoris 
k-cycle mod U on C homologous to zero mod V on C. Let Z(6) and W(5) 
consist respectively of the algebraic k-cycles z(6) and w(é) mod U on V of 
norm 6 such that 


(3.5) 2(6) ~s u, w(s) ~; 0 (mod U on (). 
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That there exist algebraic k-cycles 2(6) in (3.5) may be seen as follows. Since 
1 ~ 0 mod V on C there exists a sequence of positive constants e, tending to 
zero, and algebraic chains un41 of norm e, on C with algebraic k-chains z, of 
norm é, on Y such that 


BUnit _ Un * Zn- 


Hence Bu, = Ben. It follows that z, is an algebraic k-cycle mod U on V and 
that 
Zn ~en Un (mod U on C). 


The classes Z(6) are accordingly not empty. 

The remaining conditions on ensembles Z and reduction sets W are easily 
seen to be satisfied. 

We shall now prove the following lemma. 

Lemma 3.2. Corresponding to an admissible ensemble Z there exists a Vietoris 
k-cycle (v,) mod U on V whose components v, belong to Z(e,) for suitable positive 
constants e, approaching zero as n becomes infinite. 

We recall that there is a reduction set W associated with Z. Beginning 
with an arbitrary positive number é we shall give an inductive definition of a 
sequence of positive numbers e, tending to zero as n becomes infinite. The 
constant e of Lemma 3.1 is arbitrary. In particular we can set e = é@n_;. 
Lemma 3.1 then affirms the existence of a constant 6 < e. We take e, < 6. 
We suppose also that e, tends to zero as n becomes infinite. 

By definition of Z there is an algebraic k-cycle z, mod U on V in Z(e,) for 
each n. We shall give an inductive definition of a Vietoris k-cycle v = (vn) 
mod U on V. We begin by setting v, = z,. Suppose that the components 
v;,1 = 1, --- , m, have been defined in such fashion that v; is in Z(e;) and 


Vin Me, V5 (mod U on V;i = 1,---,m-— 1). 


We shall now define v»4:. First recall that zm41 — Um is in W(e,) in ac- 
cordance with property (2) of ensembles Z. We can apply Lemma 3.1 with 
€ = m1, With @m <6 < e and with 

w(d) = Zmi1 — Um- 


Setting » = en4i1 Lemma 3.1 affirms the existence of a cycle w(n) in W(em4+1) 
such that 


(3.6) w(d) ~e,,-, w(n) (mod U on V). 
We set 
(3.7) Um+1 = Zm41 — W(n). 


We hote that 0m: is in Z(ém41) by virtue of property (2) of ensembles Z. Upon 
adding the respective members of (3.6) and (3.7) we find that 


Guta “eo, Un (mod U on V). 


The algebraic k-cycles v, are accordingly the components of a Vietoris k-cycle v. 
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The proof of the lemma is complete. 

We shall prove three theorems which parallel facts obvious in the theory 
of singular cycles. Letting U GC V CC be three compact subsets of M we 
state the first theorem as follows. 

TueoreM 3.1. If u is a k-cycle mod U on C homologous to zero mod V on C 
then u is homologous mod U on C to a k-cycle z mod U on V. 

This theorem follows at once from Lemma 3.2 with the choice of the ensemble 
Z that of Example 3.1. 

If w and u are two Vietoris k-cycles we shall find it convenient at times to 
write w = umod A. We mean thereby that w, = un + v, where »v, is a chain 
on A andn = 1,2,°-::. 

Letting A C U C V be three compact subsets of M the second theorem is 
as follows. 

THEOREM 3.2. If a (k —1)-cycle u mod A on U is homologous to zero mod A 
on V there exists a k-cycle v mod U on V such that Bv = u mod A. 

This theorem also follows from Lemma 3.2 We take Z(é) as the class of 
all k-eycles z(5) mod U on V of norm 6 such that 


Bz(6) ~s u (mod A on U). 


The cycles 2(5) exist by virtue of the hypothesis of the theorem. The corre- 
sponding reduction group W(6) consists of all k-cycles w(é) mod U on V of 
norm 6 such that 


Bw(s) ~; 0 (mod A on U). 


With this choice of Z it follows from Lemma 3.2 that there exists a k-cycle 
z mod U on V such that Bz ~ u mod A on U. There accordingly exists a 
sequence of positive constants e, tending to zero, and algebraic (k + 1)-chains 
w, of norm e, on U with algebraic k-chains z, of norm e, on A such that 


Bwn — iy B2n + Xn 


If we set Un = Wn + Zn we see that (vn) is a Vietoris k-cycle v mod U on V 
and that 


bv =u (mod A). 


The proof of the theorem is complete. 

The third theorem is as follows. 

THEOREM 3.3. If u is a k-cycle mod A on V and Bu ~ 0 on A, there exists 4 
k-cycle v on V such that u ~ v mod A on V. 

Before proving this theorem it will be convenient to establish the following. 

(a). If x is an algebraic k-cycle x of norm e mod A on V, and Bx ~. 0 on A, 
there exists an algebraic k-cycle y on V such that x ~,. y mod A on V. 

It follows from the hypothesis in (a) that there exists an algebraic k-chain 
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of norm ¢ on A such that Bw = Br. We set y = x — w and note that y is a 
k-cycle on V of norme. To establish (a) we observe that 


B0 = —w +o = (y — 2) +4, 


so that zs ~~. y mod A on V. 

To prove the theorem we shall apply Lemma 3.2 setting U = 0. The set 
7(6) shall be the class of all k-cycles z(6) of norm 6 on V such that z(5) ~; u mod 
Aon V. The corresponding group W(é) will consist of all k-cycles w() of 
norm 6 on V such that w(é) ~;0 mod A on V. That the set Z(6) is not empty 
follows from (a), and the theorem follows from Lemma 3.2. 


4. The rank conditions. In this section we shall define an F-non-bounding 
k-eycle and assign to such a cycle a rank r. These ranks are fundamental. 
They lead to a group theoretic formulation of the structure beneath our theory. 
They are introduced here for the first time. 

Recall that the sets of points at which F < c and F & ¢ are respectively 
denoted by F, and F.,. The phrase ono F, of §1 will sometimes be replaced 
by the phrase below. c. Mod, F, shall mean mod some compact set below, c. 

If zis a k-cycle below, b, not homologous to zero below, b we term b a homology 
limit of z, and z F-non-bounding. By the superior cycle limit s(z) of an F-non- 
bounding k-cycle z is meant the least upper bound of the homology limits of z. 
If zis non-bounding on M we set s(z) = +. It follows from the definitions 
involved that s(z) is a homology limit of z. 

Let z be a k-cycle with superior cycle limit s(z) = s. By the inferior cycle 
limit t = t(z) of z is meant the greatest lower bound of constants c such that 
z2~0 mod, F, below, s. We do not define ¢(0). We observe that z ~ 0 mod, 
F, below, s. 

If z is an F-non-bounding k-cycle we term the pair [s(z), ¢(z)] the rank r(z) 
of z. We order the pairs [s, ¢] lexicographically. That is, we write [s, t] = 
[s', ’] if and only if s = s’ and t = ?’, and write (s, t) > (s’, t’) if s > s’, or if 
s=s’andt >’. In this way we are led to a simply ordered set of ranks r. 
These ranks are fundamental. 

A bounding k-cycle which is F-non-bounding will be termed ambiguous. 
The superior cycle limit of such a cycle will be finite. , 
In §5 we shall give the theory of ranks an abstract formulation and we have 
there listed four conditions which we term rank conditions. These rank condi- 
tions are satisfied by the ranks r(w), by the superior cycle limits s(u), and by 
several other basic ordered sets. Our first theorem in this connection is the 

following. 

TuroreM 4.1. The superior cycle limits s(u) of F-non-bounding k-cycles 
salisfy the rank conditions I, II, III, IV with p(u) replaced by s(u). 

I. To show that s(éu) = s(u) for 6 ~ 0 we must show that the condition 
"~ 0 below, b is equivalent to the condition 6u ~ 0 below, b. The first condi- 
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tion clearly implies the second. But the second condition also implies the first. 
For any coefficient in the field A can be divided by a non-null 6. In particular 
if 6 ~ 0 the coefficients in any sequence of algebraic bounding relations which 
imply that 6u ~ 0, can be divided by 6. Thus I holds as stated. 

II. To establish II we suppose that s(u) and s(v) are both finite. The 
validity of II when s(u) or s(v) is infinite is trivial. Let e be an arbitrary posi- 
tive constant. By virtue of the definition of a superior cycle limit, u ~ 0 
below, s(u) + e and v ~ 0 below, s(v) + e. Hence u + »v ~ 0 below, s* + ¢ 
where s* is the maximum of s(u) and s(v). Property II follows as stated. 

III. To establish III we suppose that s(u) > s(v) and let b be any homology 
limit of w between s(u) and s(v). Observe that v ~ 0 and u ~ 0 below, b so 
that u + v ~ 0 below, b. Thus b is a homology limit of wu + », and III holds 
as stated. 

IV. We set po = so. + Observe that u and v in IV are below, s). Since s(u) 
and s(v) do not exist u ~ 0 and v ~ O below, so. If s(u + v) exists it must 
be less than so , and IV is established. 

The proof of Theorem 4.1 is complete. 

It might be thought that the inferior cycle limits t(u) also satisfy the rank 
axioms. To show that this is not the case and to illustrate the preceding 
theorems we present several examples. 

Let the metric space M be the curve 


(4.1) y= —e’ sing (0 < x S 4n) 


in the (z, y) plane. On M let F = y. The function F is continuous on M. 
We waive the condition 0 S F < 1. We shall denote the points of M at which 
xz = 0, 7, 2x, and 4z by a, b, c, e respectively, and the points at which « equals 


4’ Srxnt 1y 


respectively by n, p, g. The reader will do well to trace the curve M and note 
that n, p, q are relative extrema of F. We observe that 


F(p) < F(@q) < F(n). 


Let A, B, C, E be 0-cycles mod 2 whose components are respectively identical 
with a, b, c, e. 

Example 4.1. The cycle B — C has the superior cycle limit F(n) while 
B — A has no homology limit. 

Example 4.2. In this example we shall show that cycles wu and v may have 
homology limits while uw + v has no homology limit. To that end let 


u=C-A, v= B-C, ut+tv=B-A. 


We observe that s(u) = s(v) = F(n) while s(u + v) does not exist. 
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Example 4.3. We here show that the equality is possible in rank axiom II 
as applied to s. We set 


u=B-C, v=C-E, 
and note thatu += B—E. We find that 
s(u) = siu+v) = F(n), — s(v) = FY) < s(u). 


Example 4.4. We shall show that the sign< is possible in rank axiom II. 
To this end we make use of the cycles u and v in Example 4.3 and set u’ = u + », 
y' = —u. We observe that u’ + vo’ = v. Referring to Example 4.3 we find 
that 

s(u’ + v’) < max [s(u’), s(v’)]. 


In the next three examples we shall show that for inferior cycle limits each 
one of the three cases 


(42) t(u + 0) S max {t(u), t()] 


is possible. 

Example 4.5. We shall first show the possibility of the sign > in (4.2). 
We make use of the space M and function F of the preceding examples and set 
u=B-—C,v = E — B. We observe that u + v = E — C and that 


s(u) = s(v) = F(n), s(u + v) = FQ), 
t(u) = t(v) = F(p), tu +v) =0> tu) = tr). 


Example 4.6. We here illustrate the possibility of the equality in (4.2). 
let the space M consist of two points a and b. Let the function F be 1 at a 


and 0 at b. Let w and v be two 0-cycles mod 2 with components respectively 
identical with a and b. We find that 


s(u) = s(v) = stu +0) = +0, 
tu) = ttu+v) = 1, t(v) = 0. 


Example 4.7. We shall show finally that the sign < is possible in (4.2). 
We make use of the space M and function F of the preceding example, and set 
?=v—u. We find that 


t(z) = t(u) = 1, t(u +z) = 0. 


We return to the general theory and state a basic theorem. 

TazoreM 4.2. The ranks r(u) of F-non-bounding k-cycles satisfy the four rank 
conditions of §5, r(u) replacing p(u). 

Let b be a homology limit of a k-cycle u. Let I[u, b] denote the greatest 
lower bound of constants c such that u ~ 0 mod, F, ong Fp. 

We state the following lemma. 
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Lemma 4.1. If b is a homology limit of u, v and u + », then 
(4.3) I{u + v, b] S maz [I(u, b), I(v, b)]. 
If b is a homology limit of u and of v and if 
I(u, b) > I(v, 6), 
then b is a homology limit of u + v and 
I(u + v, b) = I(u, bd). 


The lemma follows from the definitions involved. 

We shall establish Theorem 4.2. 

I. That t(6u) = t(u) if 6 ¥ 0 follows as in the proof that s(éu) = s(u). Hence 
r(éu) = r(u). 

II. To establish II we may assume without loss of generality that r(v) < r(u). 
We have then to prove that r(u + v) S r(u). According to Theorem 4.1 we 
have two cases: 


(a) s(u+v) < s(u). 
(b) s(u+v) = s(u). 


In case (a), r(u + v) < r(u) by virtue of the ordering of pairs (s, ¢). In case 
(b) we distinguish between the two subcases: 

(b)’ s(v) < s(u). 

(b)”  s(u) = s(v). 

In case (b)’ we observe that v ~ 0 below, s(u) so that u + v ~ u below, s(u). 
Hence t(u + v) = t(u). Thus r(u + v) = r(u) in case (b)’, and II is true. 

In case (b)’”’, u, v, and wu + v have common superior cycle limits. It follows 
from the preceding lemma that 


(4.4) t(u + v) S max [f(u), t(v)]. 


Hence r(u + v) is at most the maximum of r(u) and r(v), and II is established. 

III. We here show that r(u + v) exists, provided r(u) and r(v) exist and are 
unequal. We assume without loss of generality that r(u) > r(v) and consider 
the two cases: 


(a) s(u) > s(v). 

(8) s(u) = s(v), t(u) > t(v). 

In case (a) it follows from Theorem 4.1 that s(u + v) exists, and in case 
(8), s(u + v) exists by virtue of the preceding lemma. Hence r(u + ») exists 
in both cases, and III is proved. 

IV. If the cycles u = Zu; and v = Xv; in IV have no rank, s(u) and s(v) 
fail to exist. But r(wu + v) and hence s(u + v) exist by hypothesis. We set 
po = To = (80, to). It follows from the fact that superior cycle limits satisfy I\ 
that s(u + v) < s, and hence r(u + v) < 7. Hence IV holds as stated. 

The proof of Theorem 4.2 is complete. 





wil 


suc 
stil 
ist 
or 








1ce 


we 


ASE 


(u). 


OWS 


ned. 
are 
ider 


case 
xists 


s(v) 
> set 
y IV 





FUNCTIONAL TOPOLOGY AND ABSTRACT VARIATIONAL THEORY 401 


5. The underlying group structure. The implications of the rank conditions 
will be best understood if incorporated in an abstract group theory. 

We suppose then that we have an additive abelian operator group G with 
operators which form a field A. We shall be concerned with various subgroups g 
of G. We suppose throughout that these subgroups are operator subgroups, 
that is if wis in g and 6 is in A, 6wising. Our isomorphisms shall be operator 
isomorphisms, that is, if w corresponds to v, éu corresponds to 6v. 

By a subgroup g of G with property A is meant an operator subgroup of G 
every element of which, with the possible exception of the null element, has 
the property A. A group g with property A will be termed mazimal if it is a 
proper subgroup of no other subgroup g’ of G with property A. The following 
example shows that there may be several maximal groups with a given prop- 
erty A. 

Let G be a group generated by three elements a, b, c with coefficients in the 
field of integers mod 2. Suppose that a, b, a + b, c are the only elements with 
the property A. Then a and b together generate a maximal group with prop- 
erty A as does c. 

Suppose that the respective elements u of G have the property A if and 
only if for each operator 6 ~ 0, du has the property A. Let g be a maximal 
(operator) subgroup of G with property A. Let v be an arbitrary element 
with property A. There will then exist an operator 6 ~ 0 and an element » 
in g, together with an element ve which fails to have the property A, or is null, 
such that 6v = v, + ve. Now division by 6 is permissible, v,/é is in g, and v2/6 
still fails to have the property A or is null. We can accordingly infer the ex- 
istence of an element z in g, and an element w which fails to have the property A 
or is null, such that 


(5.1) ve=2z2+uw. 


The rank conditions. Let (p) be a simply ordered set of elements, symbols, 
numbers, points, etc. With some but not all of the elements u of G we suppose 
there is associated an element p(u) in the set (p). We term p(u) the rank of wu. 
The rank p(0) shall not be defined. 

We assume that the ranks p satisfy the following conditions: 


I. If u has a rank and 5 # 0, p(u) = p(éu). 
Il. If u,v, and u + v have ranks, p(u + v) S maz [p(u), p(v)]. 
Ill. If wand v have unequal ranks, p(u + v) exists. 
IV. If uy, +++, Um, V1, °** , Un have ranks at most po while the sums u = Zu; 
and v = Xv; have no rank and u + v has a rank, then p(u + v) < po. 


The elements with rank (with the null element added) will not in general 
form a group. For example consider the group G of 0-cycles in Example 4.2 
letting the superior cycle limit s(z) be the rank of the 0-cycle z in case s(z) 
exists. In this example we have seen that s(u) and s(v) exist while s(u + ») 
does not exist nor does u +v=0. 

We have already remarked that the primitive concept of counting is to be 
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replaced by comparisons involving isomorphisms of groups. A comparison by 
means of isomorphisms in general would not be very effective. Our compari- 
sons will always be by means of isomorphisms of a special sort the abstract 
counterpart of which we now define. 

Two elements of G will be said to be in the same p-class if they have the 
same rank while their difference has no rank or a lesser rank. An isomorphism 
between two subgroups of G of elements with rank will be termed a p-isomorphism 
if corresponding non-null elements are in the same p-class. These p-isomor- 
phisms will be exemplified by three classes of natural isomorphisms defined by F. 

We state the following theorem. 

THEOREM 5.1. Any two maximal groups of elements of G with a fixed rank ¢ 
are p-isomorphic. 

Let g. be the group generated by the elements of G with rank at most «. 
Let H, be the subset of elements of g, without rank or with rank less than «. 
We continue by proving the following statement. 

(a). The elements of H, form a group. 

Let u and v be arbitrary elements of H,. Writing H for exists and ~ E 
for does not exist we distinguish four cases: 


(1) p(u) EZ, ple) E. (2) p(u) E, pv) ~ E. 
(3) plu) ~ E, pl) E. (4) plu) ~ E, pve) ~ E. 


We shall prove that u + visin H,. If p(u + v) does not exist wu + visin H,. 
We assume therefore that p(u + v) exists and seek to prove that p(u + v) < «. 
This will follow from the rank conditions II, III, II, IV respectively in the 
cases (1), (2), (3), (4). In case (1), p(w) and p(v) are less than o since u and v 
are in H,. It follows from II that p(u + v) < oc. In case (2), p(u) < o and 
p(u + v) must be less than o for otherwise p(v) would exist. Case (3) is similar. 
The result in case (4) follows from IV, and the proof of (a) is complete. 

We return to the theorem and let m, be a maximal group of elements of G 
with rank o. Each element of m, is in a coset of the residue group h, defined 
by g. mod H, , and different elements u and v of m, are in different cosets of h. , 
for otherwise u — v would be in H, and hence not in m,. Moreover there s 
an element of m, in each coset z of h, for otherwise an element u of z taken 
with m, would generate a group of elements of G with rank o containing m, a8 
a proper subgroup. There is thus an isomorphism between m, and h, in which 
each element u of m, corresponds to the coset of h, which contains u. We 
term m, and h, p-isomorphic since u corresponds to a coset of elements in the 
same p-class as u. 

Let m, be a second maximal group of elements of G with rank «. The group 
m, is also p-isomorphic with h,. It follows that m, and m; are p-isomorphic 
with each other, and the proof of the theorem is complete. 

A particular consequence of the theorem is that the dimensions of any two 
maximal groups of elements of G with rank o are equal, but the theorem of 
course implies much more than this equality of dimension. 
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We note another consequence of statement (a). 

(8). The property of elements with rank being in the same p-class is transitive. 

For if u, v, and w have the rank o and if wu is in the p-class of v and w, u—»v 
and u — w belong to H,. It follows from (a) that » — w belongs to H, so 
that v and w are in the same p-class. 

We state the following lemma. 

Lemma 5.1. If uw, -++ , Um are elements of G with ranks such that 


(5.2) p(t) > p(ui) (t = 2, --+ , m) 


then p(u: + +++ + Um) exists and equals p(u,). 

Suppose first that m = 2. By virtue of rank condition III p(w; + ue) exists. 
From I we see that p(—ue) = p(w). Since uw = (wu + we — we) we can infer 
from II and (5.2) that 


p(ui) S max [p(w + ue), p(ue)] S p(w + w), 
and that 


p(uy + Ue) S max [p(u1), p(ue)] S p(w) 


so that p(u:) = p(u: + we). The proof of the lemma can be completed by 
induction with respect to m. 

Let there be given a subgroup g of G and a set of subgroups h(a) of g, a being 
an enumerating index. The group g is said to be a direct sum 


g= 2X h(a) 


of the groups h(a) if each element u of g is a finite sum of elements from the 
groups h(a), and if there exists no relation of the form 


va, + a + Ua, = 0 


in which the a;’s are distinct and wa; is a non-null element from the group 
h(a). 

From this point on we shall frequently employ the hypothesis that the ranks 
p are well-ordered. We shall not mean thereby that the ranks can be well- 
ordered but that taken in their given orders they are well-ordered.” 

We state the following theorem. 

TuEoreM 5.2. If the ranks of elements of G are well-ordered a maximal sub- 
group g of the elements of G with rank is a direct sum 


(5.3) > h(p) 


of arbitrary maximal subgroups h(p) of elements of g with rank p, and such groups 
Mp) are maximal subgroups of elements of G with rank p. 

It follows from the preceding lemma that the groups h(e) admit a direct 
sum and that this direct sum g’ is a group of elements with rank. For if 


"i, *** , Um are non-null elements of different groups h(p), Lemma 5.1 affirms 
ee enethcoasonntemmneenennnndieaiinienadisliena 


* See Hausdorff, l.e., p. 55. 
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replaced by comparisons involving isomorphisms of groups. A comparison by 
means of isomorphisms in general would not be very effective. Our compari- 
sons will always be by means of isomorphisms of a special sort the abstract 
counterpart of which we now define. 

Two elements of G will be said to be in the same p-class if they have the 
same rank while their difference has no rank or a lesser rank. An isomorphism 
between two subgroups of G of elements with rank will be termed a p-isomorphism 
if corresponding non-null elements are in the same p-class. These p-isomor- 
phisms will be exemplified by three classes of natural isomorphisms defined by F. 

We state the following theorem. 

TuroreM 5.1. Any two maximal groups of elements of G with a fixed rank ¢ 
are p-isomorphic. 

Let g. be the group generated by the elements of G with rank at most o. 
Let H, be the subset of elements of g, without rank or with rank less than c. 
We continue by proving the following statement. 

(a). The elements of H, form a group. 

Let u and v be arbitrary elements of H,. Writing EF for exists and ~ £ 
for does not exist we distinguish four cases: 


(1) p(u) E, p(v) E. (2) p(u) E, pv) ~ EB. 
(3) plu) ~ E, p(v) E. (4) p(u) ~ EB, pv) ~ £. 


We shall prove that u+visinH,. If p(u + v) does not exist u + visinH,. 
We assume therefore that p(u + v) exists and seek to prove that p(w + 0) <o. 
This will follow from the rank conditions II, III, III, IV respectively in the 
cases (1), (2), (3), (4). In case (1), p(w) and p(v) are less than o since u and v 
are in H,. It follows from II that p(u + v) < o. Im ease (2), p(u) < o and 
p(u + v) must be less than o for otherwise p(v) would exist. Case (8) is similar. 
The result in case (4) follows from IV, and the proof of (a) is complete. 

We return to the theorem and let m, be a maximal group of elements of G 
with rank c. Each element of m, is in a coset of the residue group h, defined 
by g. mod H, , and different elements u and v of m, are in different cosets of he , 
for otherwise u — v would be in H, and hence not in m,. Moreover there s 
an element of m, in each coset z of h, for otherwise an element wu of z taken 
with m, would generate a group of elements of G with rank o containing ™, 28 
a proper subgroup. There is thus an isomorphism between m, and h, in which 
each element wu of m, corresponds to the coset of h, which contains u. We 
term m, and h, p-isomorphic since u corresponds to a coset of elements in the 
same p-class as u. 

Let m; be a second maximal group of elements of G with rank ¢. The group 
m, is also p-isomorphic with h,. It follows that m, and mj are p-isomorphic 
with each other, and the proof of the theorem is complete. 

A particular consequence of the theorem is that the dimensions of any two 
maximal groups of elements of G with rank o are equal, but the theorem of 
course implies much more than this equality of dimension. 
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We note another consequence of statement (a). 

(3). The property of elements with rank being in the same p-class is transitive. 

For if u, v, and w have the rank o and if w is in the p-class of » and w, u—v 
and u — w belong to H,. It follows from (a) that v — w belongs to H, so 
that v and w are in the same p-class. 

We state the following lemma. 

Lemma 5.1. If us, +++ , Um are elements of G with ranks such that 


(5.2) p(t) > p(ui) (¢ = 2,--- , m) 


then p(t, + °+* + Um) exists and equals p(u,). 

Suppose first that m = 2. By virtue of rank condition III p(w, + up) exists. 
From I we see that p(—twe) = p(w). Since uw = (uw, + uw — we) we can infer 
from II and (5.2) that 


p(ux1) S max [p(ui + uw), p(ue)] S p(ui + w), 
and that 
p(uy + Ue) S max [p(u), p(ue)] S p(us) 


so that p(u:) = p(w: + we). The proof of the lemma can be completed by 
induction with respect to m. 

Let there be given a subgroup g of G and a set of subgroups h(a) of g, a being 
an enumerating index. The group g is said to be a direct sum 


g = L h(a) 


of the groups h(a) if each element u of g is a finite sum of elements from the 
groups h(a), and if there exists no relation of the form 


Ua, + ur + Uc, = 0 


in which the a,’s are distinct and u.,; is a non-null element from the group 
h(a;). 

From this point on we shall frequently employ the hypothesis that the ranks 
p are well-ordered. We shall not mean thereby that the ranks can be well- 
ordered but that taken in their given orders they are well-ordered.” 

We state the following theorem. 

TurorEM 5.2. If the ranks of elements of G are well-ordered a maximal sub- 
group g of the elements of G with rank is a direct sum 


(5.3) Dd h(o) 


of arbitrary maximal subgroups h(p) of elements of g with rank p, and such groups 
h() are maximal subgroups of elements of G with rank p. 

It follows from the preceding lemma that the groups h(p) admit a direct 
‘um and that this direct sum g’ is a group of elements with rank. For if 


1) *** , Um are non-null elements of different groups h(p), Lemma 5.1 affirms 
Sen ctears ees ceniemasieeidisiaipiasi 


~ See Hausdorff, Le., p. 55. 
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that the rank of their sum »v exists. Hence in particular v ¥ 0. It remains 
to prove that g’ = g. 

To that end let wu be an arbitrary non-null element of g. Set p(u) = g, 
Since h(c) is a maximal subgroup of elements of g with rank o there exists, in 
accordance with (5.1), an element »; in h(c) and an element w, not an element 
with rank a, such that u = », + w,. It follows from the rank condition II that 


Case I. Either p(w;,) fails to exist, 
Case II. Or p(w) < co. 


In Case I, w, = 0 since wy, = u — », andising. In Case II we treat u— », 
as we originally treated wu. 

Proceeding inductively we infer the existence of a-set of elements »;, i = 
1, 2, --+ , n, respectively with decreasing ranks p; , such that v; is in h(p;) and 


U=%1 + +++ +m + We 


where either p(w,) fails to exist or p(wn) < px». But a decreasing sequence of 
ranks is finite since the ranks are well-ordered, so that p(w,) fails to exist for 
some least integer n, say m. Since wp, ising, Wm = Oand u = y+ +--+: +o_. 
Thus g equals the sum (5.3) as stated. . 

To prove the final statement of the theorem let k(p) be any subgroup of 
elements of G with rank p, so chosen that k(p) D h(p). It follows from Lemma 
5.1 that the groups k(p) have a direct sum J which is a group of elements with 
rank. Moreover J > g. Since g is maximal we conclude that J = g. It 
follows that k(p) = h(p) since h(p) is maximal in g, and we conclude that h(p) 
is a maximal subgroup of elements of G with rank p. 

The proof of the theorem is complete. 

The following example shows that the omission of the hypothesis that the 
ranks are well-ordered in Theorem 5.2 would lead to a false conclusion. 

Example 5.1. Let G be generated by a countable infinity of symbols 
a, 4, +--+ with coefficients which are integers mod 2. We suppose that a, 
has a rank p, and that p; > pe > --- while the rank of a finite sum of different 
symbols a; equals the maximum of the ranks of the summands. Every ele- 
ment of G except the null element thereby possesses a rank, and the four rank 
conditions are satisfied. The element vp = a, + Qn41 taken with 0 forms a 
maximal subgroup (v,) of elements of G with rank p,. But the group G is 
not the direct sum > n(vn) since the element a; is not in this direct sum. 

Theorem 5.2 is complemented by the following theorem. 

TuEorEM 5.3. If the ranks of elements of G are well-ordered, the direct sum of 
arbitrary maximal subgroups of elements of G with the. respective ranks p 1s 4 
maximal subgroup of elements of G with rank. 

Let h(p) be a maximal subgroup of elements of G with rank p. As we have 
seen in the preceding proof the direct sum 


H = 2 h(o) 
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is permissible and defines a group of elements with rank. We seek to prove 
that H is maximal in G. 

To that end let g be a subgroup of elements of G with rank, with g > H. 
The group A(p) is maximal in G and hence in g. It follows from the preceding 
theorem that 


g = X h(p), 


and the proof of the theorem is complete. 
Note. The proofs of Theorems 5.2, 5.3 and 5.5 do not use rank condition IV. 
We shall prove the following theorem. 
TurorEM 5.4. If the ranks of elements of G are well-ordered any two maximal 
subgroups g and g’ of elements of G with rank are p-isomorphic. 
By virtue of Theorem 5.2 g and g’ are.direct sums 


9 = X g(r), g’ = Xu g'(0) 


of subgroups g(p) and g’(p) of g and g’ respectively, which are maximal sub- 
groups of elements of G with rank p. According to Theorem 5.1 g(p) and g’(p) 
are p-isomorphic. These isomorphisms between the respective groups g(p) 
and g’'{(p) imply a unique isomorphism 7' between g and g’ defined as follows. 
An arbitrary element u of the group g is of the form 


U=Utss + Un, 


where the elements u; are from different groups g(o;). If u; is the element of 
q'(o;) which corresponds to u;, then the element u shall correspond under T 
to the element 


, , / 
*™e=uwmt:-- +4. 


We shall show that 7 is a p-isomorphism. Without loss of generality we 
can suppose that 


ef. ee 


It follows from Lemma 5.1 that wu and wu’ are respectively in the p-classes of 
wand u;. But w and u; are in the same p-class. Hence u and w’ are in the 
same p-class since the property of elements being in the same p-class is transitive. 

The proof of the theorem is complete. 
; Extensions of the preceding theorems. If H is an arbitrary subgroup of @ it 
is readily seen that the ranks p of elements of H by themselves satisfy the four 
rank conditions. The preceding theorems accordingly admit an immediate 
extension in which G is replaced by H. Thus regardless of whether the ranks 
of G are well-ordered or not as a whole, the preceding theorems hold upon re- 
placing G by H, provided the ranks of H alone are well-ordered. The case 
where all the elements of H except the null element have rank will be met in 
the proof of the next theorem. 

Bi-levical ranks. Let s and t be variables which range over a simply ordered 











406 MARSTON MORSE 


set of elements (x). We suppose that the ranks p(u) of elements u of G are 
given by lexicographically ordered pairs (s, t). If an element wu of G has q 
rank p = (s, t) we shall write s = s(u) and ¢ = é(u) and term s and ¢ the first 
and second components of p respectively. We shall assume that p(u) exists 
if and only if s(w) exists, and that s(u) in turn exists if and only if t(u) exists, 
Ranks p = (s, t) satisfying these conditions will be termed bi-lezxical. 

The ranks r(u) of F-non-bounding k-cycles are bi-lexical. 

We shall prove the following theorem. 

TuHreorEM 5.5. When the ranks p = (s, t) of elements u of G are bi-lezical and 
well-ordered each maximal group of elements of G with rank is a direct sum of 
maximal groups of elements of G with the respective values of t, and arbitrary 
maximal groups of elements of G with the respective values of t sum to a maximal 
group of elements of G with rank. 

A maximal group g of elements of G with rank is a direct sum of maximal 
groups g(p) of elements of G with the respective ranks p, as we have seen in 
Theorem 5.3. Let h(t) be the direct sum of those groups g(p) for which the 
second component of p ist. Then g is a direct sum 


(5.4) g = > h(t). 
Moreover each group h(t’) must be a maximal group of elements of G for 
which t(u) = #’. For otherwise there would exist an element v such that 


t(v) = t’ and such that the direct sum of the group (v) generated by v and 
h(t’), is a group of elements u for which ¢(u) = ¢’. If wu is an element of the 


group, 
(5.5) H = > hit) (t # t) 


then t(u) ¥ t’, so that the ranks of elements of (v) + A(t’) and H are different. 
Hence we can form the direct sum 


K = (vl) +h) + A, 


obtaining thereby a group of elements with rank. Moreover K contains 9 
as a proper subgroup. From this contradiction we infer that the group h(t’) 
is a maximal group of elements of G for which t(u) = ¢t’, and the proof of the 
first affirmation in the theorem is complete. 

We shall now prove the second affirmation in the theorem. 

To that end let k(t) be an arbitrary maximal subgroup of elements u of G 
for which t(u) = t. Let g(s, é) be a maximal subgroup of elements v of k(t) 
for which s(v) = s. By virtue of the extension of the preceding theorems noted 
just before the present theorem, k(é) is a direct sum 


I say that g(s, ¢) is a maximal group of elements u of G for which p = (s, t). 
Otherwise we could reason as in the proof that A(t) was maximal and obtain 4 
group p(t) of elements » for which ¢(v) = t and which would contain k(¢) as @ 
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proper subgroup. Hence g(s, t) is a maximal group of elements of G with rank 
o = (s,t). It follows from Theorem 5.3 that the group 


» g(s,t) = x k(t) 


is a maximal group of elements of G with rank, and the proof of the theorem 
is complete. 

«Telescopic group sequences. Our remarks in the remainder of this section 
are largely by way of orientation. 

In earlier papers on critical points of functions the author has been concerned 
with integers nx = M; — R,, positive or zero, where M;, was the kt type 
number sum of the critical sets and R, was the k** connectivity of M. The 
numbers yu; , When finite, were found to be of the form 


(5.6) Me = Yr + YK (vy. 20;k = 0,1, ---) 


where y; is an integer and y_; = 0. 
We continue with the following lemma. 
Lemma 5.2. A necessary and sufficient condition that a sequence of non- 


negative integers uo , M1, *** be representable in the form (5.6) is that these integers 
satisfy the inequalities 
(5.7) Mn — Bn + — + (—1)"o 20 (n = 0, 1, oe), 


If the numbers yu; are representable in the form (5.6) we find that 
(5.8) Mn — Mn-1 + ay + (—1)"uo = Va 


so that the inequalities (5.7) are a consequence of (5.6). Conversely if the 
relations (5.7) are satisfied and we define y, by means of (5.8), we find that 
y: 2 0 and that (5.6) holds. 

The inequalities (5.7) were fundamental in the earlier theory and were the 
source of numerous existence theorems for critical points. 

The present more general theory is formally analogous to the earlier theory 
with groups m, replacing the numbers yu; and the group operation of reduction 
with respect to a modulus replacing the operation of subtraction in (5.7). In 
this connection it should be noted that (5.7) can be written in the form 


(5.9) Mn — [Mn 3 (un—2 ab Ho) aay ] = 0, 


employing subtractions only. 

More explicitly we shall be concerned with an infinite sequence of groups 
m,m, +--+. The function F will give rise to certain natural isomorphisms 
termed +-isomorphisms, which map a group g, onto a group denoted by gi. 
In terms of these «isomorphisms a sequence of groups m; will be termed 
+lelescopic if there exists an auxiliary sequence of groups gz such that 


(5.10) me = ge + g-1 (ga = 0;k = 0,1, ---). 
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The relations (5.10) are the formal analogues of the relations (5.6). In fact 
if uy, and yz are the dimensions of m, and g; respectively the relations (5.6) 
are a consequence of the relations (5.10). 

We can also give a formal analogue of the relations (5.9). To that end 
we read mod* A, “mod a group +-isomorphic with A.” We then replace (5.9) 
by the relations 


(5.11) m, mod* (m,_, mod* (mny_2 --- mod* (m) ---) 20, (n =0,1,---) 


reading 20 “exists.” In (5.11) as in (5.9) the operations indicated are to be 
successively performed starting with the innermost parenthesis. The symbolic 
expressions 


m, mod* mo , 


(5.12) m, mod* m, mod* my , 


oeeereee eee eer eeeee 


appear in infinitely many of the relations (5.11), but in each of these relations 
they shall indicate the same groups. 

It is easily seen that a necessary and sufficient condition that the groups m, 
satisfy relations of the form (5.11) is that there exist auxiliary groups g, such 
that the groups m, are of the form (5.10) for suitably defined +#isomorphisms 
These *-isomorphisms will presently receive a concrete exemplification. 


II. Naturau IsoMORPHISMS 


6. k-caps and a-isomorphisms. The object of chapter II is to establish 
certain general relations between groups of F-non-bounding 7-cycles and groups 
of j-caps. We shall make use of three basic types of isomorphisms, r-isomor- 
phisms between groups of F-non-bounding k-cycles, a-isomorphisms between 
groups of k-caps and *-isomorphisms between groups of F-non-bounding k-cycles 
and groups of (k + 1-caps. These isomorphisms depend in an essential way 
upon the nature of F. 

We first introduce several new concepts. 

k-caps. A k-cycle u on F., mod, F, will be termed a k-cap belonging to 
a if 

un 0 (on Fa, mod, F;). 


The constant a will then be termed a k-cap limit a(u) of wu. The k-cap limits 
a(u) satisfy the four rank axioms a(u) replacing p(u), provided the underlying 
group G of the preceding section be correctly defined. 

To define G we term a sequence w of algebraic k-chains w, with norms ¢ 
tending to zero a formal k-chain. To add two formal k-chains we add their 
components, while éw shall be the formal k-chain with components 6v, . Formal 
k-chains make up an additive abelian operator group with operators 6. The 
components of a formal k-chain w may in particular be the components of a 
k-cap z. In such a case it is easily seen that a(z) is uniquely determined by v. 
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To add two k-caps wu and v consider the formal k-chain w = u + v. If w 
defines a k-cap z we regard z as the cap sum of wu and v. If w defines no k-cap 
we understand that there is no cap sum u + v. In the group of formal k-chains 
G certain formal k-chains are thus k-caps and possess cap limits. Concerning 
those cap limits we have the following theorem. 

TueorEM 6.1. The k-cap limits a(u) satisfy the rank conditions, a(u) replacing 
p(u) with the underlying group G the group of formal k-chains. 

The proof of this theorem is immediate. 

We shall give certain definitions which concern k-caps. We shall term an 
homology 


u~0d0 (on, F.4 mod, F.) 


a c-homology. The earlier p-class and p-isomorphism are here replaced by their 
formal analogues a-class and a-isomorphism. It is seen that two k-caps with 
cap limits c belong to the same a-class if and only if their difference is c-homol- 
ogous to zero. 

Let p be a point on M and c a number such thatO Sc <1. We term (p, c) 
a limiting pair if c is a limit at p of values of F(p,) for some sequence p, tending 
to p. We include pairs of the form [p, F(p)] as limiting pairs. There are 
certain limiting pairs which are particularly distinguished and which will be 
called critical or semi-critical pairs. They will be defined in §9. Their exis- 
tence depends upon a series of theorems which we shall now develop. 

A k-cap which is a k-cycle will be termed a k-cycle-cap. The following 
theorem may be regarded as a first existence theorem in the theory of k-caps. 

THEOREM 6.2. Corresponding to an arbitrary F-non-bounding k-cycle u there 
is a k-cycle-cap z in the r-class of u such that the cap limit a(z) differs arbitrarily 
hitle from the inferior cycle limit t(u). 

If t(u) = 1, wis a cycle-cap and the theorem is true. We suppose then that 
tu) < 1. 

If the r-class of wu contains a cycle-cap belonging to t(u) the theorem is again 
true. If the r-class of u contains no such cycle-cap let 8 be an arbitrary constant 
less than 1 such that t(u) < B < s(u). 

There exists a k-cycle u’ in the given r-class below, 8 with u’ ~ 0 on F < 8. 
Let a be the inferior cycle limit of u’ on F < 8. It follows from Lemma 2.2 
that u’ ~ 0 mod F S aonF S B. Upon setting F.. = V and Fs, = C with 
U = 0 it follows from Theorem 3.1 that there exists a k-cycle z on V such that 
w’~zonC. But z is not a-homologous to zero for otherwise a would not be 
an inferior cycle limit of z on F < 8. Hence z is a cycle-cap belonging to a. 

The theorem follows directly. 

An F-non-bounding k-cycle u which lies on the domain F < ¢(u) will be 
termed canonical. We have seen that there is a cycle-cap in the r-class of 
each F-non-bounding k-cycle u, but there need be no canonical k-cycle in the 
r-class of u as the following example shows. 
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The relations (5.10) are the formal analogues of the relations (5.6). In fact 
if uw, and y, are the dimensions of m, and g, respectively the relations (5.6) 
are a consequence of the relations (5.10). 

We can also give a formal analogue of the relations (5.9). To that end 
we read mod* A, “mod a group *-isomorphic with A.” We then replace (5.9) 
by the relations 


(5.11) m, mod* (my. mod* (mp2 --+ mod* (m) +--+) 20, (n =0,1,---) 


reading 20 “exists.”’ In (5.11) as in (5.9) the operations indicated are to be 
successively performed starting with the innermost parenthesis. The symbolic 
expressions 


m, mod* m , 


(5.12) mz mod* m, mod* my , 


appear in infinitely many of the relations (5.11), but in each of these relations 
they shall indicate the same groups. 

It is easily seen that a necessary and sufficient condition that the groups m, 
satisfy relations of the form (5.11) is that there exist auxiliary groups g, such 
that the groups m, are of the form (5.10) for suitably defined *-isomorphisms 
These «isomorphisms will presently receive a concrete exemplification. 


II. Natura. IsoOMORPHISMS 


6. k-caps and a-isomorphisms. The object of chapter II is to establish 
certain general relations between groups of F-non-bounding 7-cycles and groups 
of j-caps. We shall make use of three basic types of isomorphisms, r-isomor- 
phisms between groups of F-non-bounding k-cycles, a-isomorphisms between 
groups of k-caps and *-isomorphisms between groups of F-non-bounding k-cycles 
and groups of (k + 1-caps. These isomorphisms depend in an essential way 
upon the nature of F. 

We first introduce several new concepts. 

k-caps. A k-cycle u on F., mod, F, will be termed a k-cap belonging to 
a if 

un0 (on F.4 mod, F,). 


The constant a will then be termed a k-cap limit a(u) of u. The k-cap limits 
a(u) satisfy the four rank axioms a(u) replacing p(u), provided the underlying 
group G of the preceding section be correctly defined. 

To define G we term a sequence w of algebraic k-chains w, with norms ¢ 
tending to zero a formal k-chain. To add two formal k-chains we add their 
components, while dw shall be the formal k-chain with components éw, . Formal 
k-chains make up an additive abelian operator group with operators 6. The 
components of a formal k-chain w may in particular be the components of a 
k-cap z. In such a case it is easily seen that a(z) is uniquely determined by wv. 
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To add two k-caps u and v consider the formal k-chain w = u + v. If w 
defines a k-cap z we regard z as the cap sum of u and v. If w defines no k-cap 
we understand that there is no cap sum u + v. In the group of formal k-chains 
G certain formal k-chains are thus k-caps and possess cap limits. Concerning 
those cap limits we have the following theorem. 

TurorEM 6.1. The k-cap limits a(u) satisfy the rank conditions, a(u) replacing 
p(u) with the underlying group G the group of formal k-chains. 

The proof of this theorem is immediate. 

We shall give certain definitions which concern k-caps. We shall term an 
homology 


u~0O (on, F.+ mod, F.) 


a c-homology. The earlier p-class and p-isomorphism are here replaced by their 
formal analogues a-class and a-isomorphism. It is seen that two k-caps with 
cap limits c belong to the same a-class if and only if their difference is c-homol- 
ogous to zero. 

Let p be a point on M and c a number such thatO0 Sc $1. We term (p, c) 
a limiting pair if c is a limit at p of values of F(p,) for some sequence p, tending 
to p. We include pairs of the form [p, F(p)] as limiting pairs. There are 
certain limiting pairs which are particularly distinguished and which will be 
called critical or semi-critical pairs. They will be defined in §9. Their exis- 
tence depends upon a series of theorems which we shall now develop. 

A k-cap which is a k-cycle will be termed a k-cycle-cap. The following 
theorem may be regarded as a first existence theorem in the theory of k-caps. 

THEOREM 6.2. Corresponding to an arbitrary F-non-bounding k-cycle u there 
is a k-cycle-cap z in the r-class of u such that the cap limit a(z) differs arbitrarily 
little from the inferior cycle limit t(u). 

If t((u) = 1, wis a cycle-cap and the theorem is true. We suppose then that 
tu) < 1. 

If the r-class of u contains. a cycle-cap belonging to t(u) the theorem is again 
true. If the r-class of u contains no such cycle-cap let 8 be an arbitrary constant 
less than 1 such that t(u) < B < s(u). 

There exists a k-cycle wu’ in the given r-class below. 8 with u’ ~ 0 on F S 8. 
Let a be the inferior cycle limit of u’ on F S B. It follows from Lemma 2.2 
that u’ ~Omod F S aonF Sg. Uponsetting F., = V and Fs, = C with 
U = 0 it follows from Theorem 3.1 that there exists a k-cycle z on V such that 
w'~zonC. But z is not e-homologous to zero for otherwise a would not be 
an inferior cycle limit of z on F S 8. Hence z is a cycle-cap belonging to a. 

The theorem follows directly. 

An F-non-bounding k-cycle u which lies on the domain F S t(u) will be 
termed canonical. We have seen that there is a cycle-cap in the r-class of 
each F-non-bounding k-cycle u, but there need be no canonical k-cycle in the 
class of u as the following example shows. 
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Example 6.1. Let the space M consist of the closure of the curve 
rnage 
y = (1 — 2) sin — (0<21) 


together with a straight line segment joining the point (0, 1) to the point 
(—1, —1) in the (z,y) plane. On M let F = (y + 1)/2 80 that 0 < F <1, 
Let u be a 0-cycle mod 2 whose components u consist of the two points (—1, —1) 
and (1,0). Wesee that t(u) = 0. There is no 0-cycle on the domain F = 0 
in the r-class of wu. 

Example 6.2. We can modify the preceding example to show that an in- 
ferior cycle limit may equal no cap limit. To that end we join the bottom 
points (—1, —1), (—1, 0) of the preceding point set M by a semi-circle on the 
domain y < —1 and let the resulting point set be denoted by M’. On M’ let 
F = y waiving the conditions thatO < F < 1. The 0-cycle u of the preceding 
example now has an inferior cycle limit which equals no cap limit, since the 
semi-circle can be deformed on itself away from its end points. 


7. «-Isomorphisms. If v is a k-cap such that Bv ~ 0 below, its cap limit 
a(v), v will be termed linkable. A non-linkable k-cap v will be said to cap its 
boundary u = #v and u will then be said to admit the cap v. A group of F-non- 
bounding k-cycles will be said to be *-isomorphic with a group gx of (k + 1)- 
caps if g, and 9h admit an isomorphism in which each k-cycle of g; is capped by 
its correspondent in gp. 

We shall make use of the following lemma. 

Lemma 7.1. If u is a k-cycle mod, F. on F S a and is a-homologous to zero, 
then Bu ~ 0 below, a. 

Under the hypotheses of the lemma we can affirm the existence of a positive 
constant e with the following property. Corresponding to the n‘ component 
Un Of u there exists an algebraic k-chain w, on F S a — e and an algebraic (k + 1)- 
chain z, on F S asuch that 6z, = un + wy, where Up , Wn, and Z, possess norms 
e, which tend to zero as n becomes infinite. Hence Bu, + Bw, = 0 and Bun ~e, 9 
onF Sa-—e. 

The proof of the lemma is complete. 

An F-non-bounding k-cycle uw which admits a k-cap v is bounding. A k-cycle 
which is both F-non-bounding and bounding has been termed ambiguous. With 
this understood we state the following theorem. 

THEOREM 7.1. An ambiguous k-cycle u admits at least one (k + 1)-cap », 
with a(v) = s(u). 

Let s(u) be denoted by a. If e is an arbitrary positive constant then u ~ 0 
below, a + e by virtue of the definition of s(u). If a < 1 it follows from 
Lemma 2.2 that u ~~ 0on F S a. This homology is trivial if a = 1. 

We shall apply Theorem 3.2. To that end we set A = 0, and let V bea 
carrier of the homology wu ~ 0 on F < a. The cycle u has a carrier U on V 
below, a. It follows from Theorem 3.2 that there exists a (k + 1)-cy¢ le 
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ymod U on V such that 6v = u. Moreover v is not a-homologous to zero for 
otherwise Bv ~ 0 below, a by virtue of Lemma 7.1, and we could infer that 
u ~ 0 below, a contrary to the nature of a = s(u). 

Thus v caps u, and the proof is complete. 

Let g. be a group of ambiguous k-cycles representable as a direct sum 


(7.1) ge = > gu(c) 


of subgroups gi(c) of k-cycles u for which s(u) = c, where c ranges over an 
arbitrary set of superior cycle limits s. We term gx s-neuclear, and state the 
following theorem. 

TuEorEM 7.2. An s-neuclear group gx of ambiguous k-cycles is *-isomorphic 
with at least one group gx of (k + 1)-caps. 

Let (z)- be a maximal linear set in g,(c) and let (z) be a set theoretic sum of 
the sets (z).. The set (z) will form a maximal linear set for g,. With each 
cycle z; in (2) we now associate a (k + 1)-cap K(z;), as is possible by virtue of 
Theorem 7.1. Weset K(0) = 0. Leta, --- , 2, be a set of distinct cycles in 
(z). Ifu = 6:2; we set 

K(u) = 6: K(z), 
and prove statements (a) and (§). 
(a). The cycle u is capped by K(u) if u # 0. 
In proving (a) we can assume without loss of generality that no 6; is null. 


Set a = s(u). Recalling that a = max s(z;) we see that each K(z;) lies on 
F s a, and hence K(u) lies on this same domain. Moreover 


BK(u) = d;z¢ = u. 


Finally K(u) is not a-homologous to zero, for otherwise u ~ 0 below, a by 
virtue of Lemma 7.1. 
The proof of (a) is complete. 
(8). The (k + 1)-caps K(u) form a group g; isomorphic with the group gi . 
Let x and y be two cycles of g; of the form 


x = 6:2, y = 6:2, 
where the z,’s form a set of distinct cycles of (z). We see that 
K(x + y) = (6; + 6;) K(z:) = K(x) + K(y). 


Hence the caps K(u) define a group homeomorphic with g,. That this homeo- 
morphism is simple follows from the fact that K(u) = 0 only if wu = 0, in ac- 
cordance with (a). 

The proof of the theorem is complete. 


8. Well-ordered cap limits. The hypothesis that the cap limits are well- 
ordered has important consequences which will be studied in this section. 
THEorEM 8.1. Let u be an F-non-bounding k-cycle. If the k-cap limits 
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greater than t(u) are bounded away from t(u) there is a canonical k-cycle in the 
r-class of u. 

By virtue of Theorem 6.2 there exists a k-cycle-cap z in the r-class of u such 
that a(z) differs arbitrarily little from ¢(u). Under the hypotheses of the present 
theorem a(z) = ¢(u) if a(z) — t(u) is sufficiently small. But if a(z) = t(u), 
z will be a canonical k-cycle in the r-class of u and the proof of the theorem 
is complete. 

Corotuary. If the k- and (k + 1)-cap limits are well-ordered each k-cycle 
limit is a k- or (k + 1)-cap limit and the ranks of F-non-bounding k-cycles are 
well-ordered. 

That each superior cycle limit of a k-cycle is a (k + 1)-cap limit follows from 
Theorem 7.1, while inferior cycle limits of k-cycles are k-cap limits by virtue 
of the preceding theorem. 

The ranks r = (s, #) of F-non-bounding k-cycles can be grouped in classes 
with common s, and these classes will be well-ordered among themselves by 
virtue of the well-ordering of the (k + 1)-cap limits s. The ranks r in each 
such class can now be further ordered according to the values of ¢. Each class 
will then be well-ordered. The ranks thus receive their prescribed lexico- 
graphic order and it is readily seen that they are thereby well-ordered. 

Let u be a k-cycle mod, H where H is a domain F, or F.;. A constant 
b > awill be called a homology limit of u mod, H if wis on, F, and u ~ 0 below, 
b mod, H. A k-cap u with cap limit a will be termed degenerate if u possesses 
no homology limit mod, F.. That degenerate k-caps exist is shown by the 
following example. 

Example 8.1. Let the space M consist of the points (x, y) on the curve 
y = xsin 1/z for 0 S x S 1 and let F = (y + 1)/2 on M. Let u, bean 
algebraic 0-cycle mod 2 which coincides with the point (0, 0). The Vietoris 
0-cycle u = (un) is a degenerate 0-cap. 

We continue with the following theorem. 

THEOREM 8.2. Let u be a k-cap with cap limit a. If the (k + 1)-cap limits 
greater than a are bounded away from a, u is non-degenerate. 

The theorem is trivial when a = 1. 

Suppose a < 1. Let b be a lower bound between 1 and a of the (& + 1)-cap 
limits greater than a. We shall prove the following statement. 

(i). The constant b is a homology limit of u mod, Fa. 

Suppose (i) false. Then for some constant c between a and b and some 
constant a <a,u~OonF ScmodF Sa. Upon setting 


(8.1) A = Fa, U = Fu, V = Fe, 


it follows from Theorem 3.2 that there exists a (k + 1)-cycle v mod U on V 
such that Bv = u mod A. Moreover 


(8.2) vx 0 (mod Fa+ on Fes); 


for otherwise 6v ~ 0 on U, and hence u ~ 0 mod A on U contrary to the nature 
of u as a k-cap. 
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From (8.2) it follows that there is a greatest lower bound ¢ of numbers e > a 
such that v ~ O mod F S eon F Sc. Upon referring to Lemma 2.2, setting 
B= F., and C = F,, we see that 


(8.3) v~0 (mod F,, on F.,). 
It follows from (8.2) and (8.3) that? > a. We next apply Theorem 3.1, setting 
U = Fa, V= Fi, C= Fe, 


and infer from (8.3) that there exists a (k + 1)-cyclez mod F S aonF St 
homologous to v mod F S aon F Sc. It follows from the choice of ¢ that v 
and hence z ~ 0 mod, F; on F,,. Since z is on F S tf, ¢ is a cap limit of z 
with a < t < b, contrary to the choice of b. 

The proof of (i), and of the theorem is complete. 

We shall prove the following theorem. 

TueorEM 8.3. (1). There is a k-cycle-cap v in the a-class of each linkable 
k-cap u. (2). If wu ts non-degenerate v is F-non-bounding. (3). If the k- and 
(k + 1)-cap limits are well-ordered v may be chosen so as to be canonical. 

We suppose that a is the cap limit of u and let B be a carrier of uon F S a 
Since u is linkable there exists a constant a < a such that Bu ~ OonF S a. 
If we denote the latter domain by A, set V = A + B, and apply Theorem 3.3, 
statement (1) of Theorem 8.3 follows directly. 

If u is non-degenerate v is likewise. The cycle v thus possesses a homology 
limit mod, F, and a fortiori a homology limit so that (2) is true. 

We now turn to the proof of (8). 

If the (k + 1)-cap limits are well-ordered, the (k + 1)-cap limits greater 
than a are in particular bounded away from a so that u is non-degenerate in 
accordance with Theorem 8.2. As stated in (2), v is then F-non-bounding. 
If the k- and (k + 1)-cap limits are well-ordered, the ranks of F-non-bounding 
k-cycles are well-ordered, and by virtue of Theorem 8.1 there is a canonical 
k-cycle in each r-class of k-cycles. A maximal group g(r) of k-cycles with rank r 
can accordingly be made up of canonical k-cycles. If h(é) is the sum of the 
groups g(r) for which the second component of r is t, >. A(t) will be a maximal 
group of k-cycles with rank. Each non-null cycle z in a group A(t) will lie 
on F S ¢ and have ¢ as an inferior cycle limit. Hence each such cycle z will 
be canonical. 

There accordingly exists a set , *** , 0, Of canonical k-cycles from different 
groups h(t) such that the k-cycle w = v — (wv. + --- + vx) is without rank. 
The cycle w is not a k-cap, for otherwise w would be a re egg k-cap 
and hence have rank. Hence v is in the a-class of (v1 +---+v,) =z. With- 
out loss of generality we can suppose that t(v:) > --- > t(0,). The cycle-cap z 
is not necessarily canonical, but it is in the sie of v;, and » is canonical. 
Thus u is in the a-class of the canonical cycle v, , and the proof of the theorem 
is complete. 


An a-class of k-caps will be said to be of excess type if it contains no cycle 
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or contains an ambiguous k-cycle. Otherwise the a-class will contain a nop- 
bounding k-cycle but no ambiguous k-cycle and be said to be of space type. 
A k-cap will be said to be of excess or space type according as it is in an a-class 
of excess or space type. A linkable k-cap may be either an excess or space cap, 
A non-linkable k-cap is always an excess cap. An excess k-cap and a space 
k-cap are never in the same a-class, and accordingly their difference is always 
a k-cap. 

We shall now prove four theorems on the composition of maximal groups of 
k-caps. 

Tueorem 8.4. If the k-cap limits are well-ordered, a maximal group M, of 
k-caps is a direct sum M, = M;, + M;, of arbitrary maximal subgroups respec- 
tively of excess and space k-caps of M;. The groups M;, and Mx are marimal 
among k-caps in general. Conversely arbitrary maximal groups of excess and 
space k-caps sum to a maximal group of k-caps. 

We shall begin by proving the following statement. 

(a). A maximal group ux of k-caps with a cap limit c is a direct sum 


/ ” 
Me = Me 7 Mk 


of maximal groups yu, and ue of excess and space k-caps respectively with cap 
limit c. 

The subset of linkable k-caps of yu, together with the null element form a 
group a, so that yu, is a direct sum uw, = ag + c;, in which c; is a maximal sub- 
group of non-linkable k-caps of u,. The subset of excess k-caps of a; together 
with the null element form a group a; so that a, is a direct sum a, + 6; in 
which 6; is a maximal subgroup of space k-caps of a,. Hence yx is a direct 
sum a. + Be + c,. We set ui = ox +c and uy, = &. That pu; is a group 
of excess k-caps with cap limit c follows from the fact that a non-linkable and 
a linkable k-cap with cap limits c¢ sum to a non-linkable k-cap with cap limit c. 
That pz and yp, are respectively maximal groups of excess and space k-caps 
with cap limit ¢ follows from the relation u, = us + wr, the fact that ux is 
maximal, and the fact that an excess and a space k-cap with cap limit ¢ sum 
to a k-cap with cap limit c. 

The proof of (a) is complete. 

We can now establish the first two statements in the theorem. 

Since the cap limits satisfy the rank conditions it follows from Theorem 5.2 
that a maximal group M;, of k-caps is a direct sum of groups u:(c) which are 
maximal groups of k- “caps with the respective cap limits c. Each such group 
ux(c) is a sum yz(c) + ux (c) of the nature described in (a). Upon setting 


mi, = »» ur(c), my = Mk (ce) 


c 


we see that M, = m, + m:. The groups m; and m;, are respectively groups 
of excess ,and space k-caps. That they are maximal follows from the relation 
M,. = m, + m; and the fact that an excess and a space k-cap always sum to 
a k-cap. 
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The proof of the first two statements in the theorems is complete. 
o , 
To prove the final statement in the theorem we assume that M; and M; 
are respectively maximal groups of excess and space k-caps. The sum 


(8.4) L, = Mi + My 


is composed of k-caps. Let M; be a maximal group of k-caps such that M; D L,. 
’ ! ; 3 / 

The groups M;, and M; are maximal among formal k-chains and hence in M, . 

It follows from the first statement in our theorem that 


M, = Mi + M;. 


The proof of the theorem is complete. 

TuroreM 8.5. If the k-cap limits are well-ordered for each k, 

(i) there exists a maximal group g of ambiguous k-cycle-caps which is a maximal 
group of ambiguous k-cycles, 

(ii) such a group g, admits a *-isomorphic group of (k + 1)-caps 9k 

(iii) while the groups gx and es admit a direct sum 


(8.5) me = ge + git (k =0,1,--:) 


which is a maximal group of excess k-caps. 

We shall prove statement (i) with the aid of Theorem 5.5. 

To that end we shall introduce a new set of ranks o, assigning ranks o only 
to ambiguous k-cycles u and for such cycles setting’ o(u) = r(u). The ranks o 
clearly satisfy rank conditions I, II, III. 

There is a canonical k-cycle in each r-class by virtue of Theorem 8.1. It 
follows that there exists a maximal group k(t) of ambiguous k-cycles with a 
prescribed inferior cycle limit ¢ with the property that the non-null cycles of 
k(t) are canonical. We now apply Theorem 5.5 employing ranks c. It follows 
that Dk(t) is a maximal group g of ambiguous k-cycles. But each cycle of 
k(t) is a eycle-cap with cap limit t. The group g is thus a group of cycle-caps. 
Since g is maximal as a group of ambiguous k-cycles it is a fortiori maximal as 
a group of ambiguous k-cycle-caps. 

The proof of statement (i) is complete. 

Statement (ii) is a consequence of Theorem 7.2. 

We continue with a proof of the following statement. 

(a). The groups g, and nll admit a direct sum m; which is a group of excess 
k-caps. 

Let uw and v be k-caps in g, and gz-1 respectively. We seek to prove that 
u + vis an excess k-cap, If a(u) > a(v), uw + v is a k-cap in the a-class of wu, 
and since u is of excess type u + vis of excess type. “The case where a(v) > a(u) 
ls similar. If a(u) = a(v), uw + v is non-linkable and hence of excess type. 
Thus (a) is true. 

(8). If w is an arbitrary non-linkable k-cap there is an element w in gi. such 
that u — w is not a non-linkable k-cap with cap limit a(u). 

Let Bu = v. The cycle v is ambiguous so that there exists a cycle z in gia 
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such that v — z has no rank. Let w be the image of z in gz. Since » — : 
has no rank v — z ~ 0 below, a(u). Hence 


(8.6) Blu —- w) =v—z~0 [below, a(u)]. 


If a(u — w) = a(u), uw — wis linkable by virtue of (8.6) and (8) is true. If 
a(u — w) does not exist or is less than a(u) we see that (8) is again true. 

(y). If wis an excess linkable k-cap there is an element w in gi, such that u — w 
is not a k-cap with cap limit a(u). 

Let v be an ambiguous k-cycle in the a-class of u. There exists a k-cycle w 
in g, such that v — w is not an ambiguous k-cycle and hence not a k-eap. Thus 
v, w, and wu are in the same a-class. Since wu and w are in the same a-class, 
a(u — w) does not exist or is less than a(u), and (7) is true. 

(6). If u is an excess k-cap there is an element w in m, such that u — w is not 
an excess k-cap with cap limit a(u). 

In case wu is linkable (6) follows from (7). 

We accordingly assume that u is non-linkable. According to (8) there 
exists a k-cap w in gi such that w — w is not a non-linkable k-cap with cap 
limit a(u). The difference wu — w may satisfy (6). Otherwise a(u — w) = a(u) 
and u — w is an excess linkable k-cap to which (y) applies. Hence (6) holds 
in all cases. 

We can now prove (iii). . That m, is a group of excess k-caps has been proved 
in (a). It remains to show that m, is maximal. To that end let wu be an 
arbitrary excess k-cap and w the corresponding k-cap in (6). If wu — w is not 
an excess k-cap (iii) is true. If w— w is an excess k-cap a(u — w) < a(u) in 
accordance with (6). Set w = wi. We now treat wu — w, as we did wu. Pro- 


- ceeding inductively and recalling that a decreasing sequence of k-cap limits 


is finite we conclude that there exists a set w;, We, +++ Wm of k-caps in m, with 
decreasing cap limits such that wu — (w, + --- + w,») is not an excess k-cap. 

The proof of (iii) and of the theorem is complete. 

Upon referring to Lemma 5.2 we have the following corollary. 

Corotuary. The dimension yp; of the maximal groups m, of excess k-caps, 
of finite, satisfy the relations (5.7). More generally the growps my, are *-telescopic 
and satisfy relations of the form (5.11). 

We come finally to maximal groups of space k-caps. We shall prove the 
following theorem. 

THEOREM 8.6. If the k- and (k + 1)-cap limits are well-ordered each maximal 
group of canonical non-bounding k-cycles is 

(1) a maximal group of nen-bounding k-cycles, 

(2) a maximal group of space k-caps. 

We shall first prove statement (1) of the theorem. 

To that end we assign ranks ¢ to canonical non-bounding k-cycles setting 
o(u) = r(u). For such cycles s(u) = +0. The rank conditions I, II, II are 
satisfied by ranks o. 
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Let g(c) be a maximal group of canonical non-bounding k-cycles with rank o. 
It follows from Theorems 5.2 and 5.3 that the sum 


(8.7) g = > g(c) 


‘; a maximal group of canonical non-bounding k-cycles and each maximal 
group of canonical non-bounding k-cycles is of the form (8.7). But there is a 
canonical k-cycle in each r-class of k-cycles. It follows that g(c) is a maximal 
group of non-bounding k-cycles with rank r = o. Moreover the ranks r of 
non-bounding k-cycles satisfy the rank conditions I, II, III as is readily seen. 
Hence the preceding group g is a maximal group of non-bounding k-cycles, and 
statement (1) of the theorem is proved. 

We continue with the following lemma. 

Lemma 8.1. A necessary and sufficient condition that a linkable k-cap u be 
of space type is that on M 


(8.8) ux~0 [mod, F < a(u)]. 


Any k-cycle in the a-class of a space k-cap is canonical and non-bounding, and 
conversely any canonical non-bounding k-cycle is a space cap. 

Suppose u is a space k-cap. Let v be a k-cycle in the a-class of u. If (8.8) 
were false v ~ 0 mod, F < a(u). There would then exist a k-cycle w below, a(u) 
such that v — w ~ 0. But v — w would be in the a-class of v and hence of u 
contrary to the hypothesis that wu is a space k-cap. Hence the condition (8.8) 
is necessary. 

To prove the condition (8.8) sufficient we again let v be a k-cycle in the a-class 
of u. It follows from (8.8) that 


(8.9) vx 0 [mod, F < a(u)] 


and a fortiori that v is non-bounding. Thus wu is a space cap. The condition 
(8.8) is accordingly sufficient. 

The second statement in the lemma follows at once from the first. 

To establish (2) in Theorem 8.6 we assign ranks b(w) to space k-caps u setting 
O(u) = a(u). It is clear that the ranks b(u) satisfy the first three rank condi- 
tions, 

We shall prove that the group g in (8.7) is a maximal group of space k-caps. 

Observe that g(c) in (8.7) is a group of space k-caps with cap limit ¢ where 
o = (s,t) = (@, #). The group g(c) is a maximal group of space k-caps with 
cap limit ¢; for there is a canonical non-bounding k-cycle in the a-class of each 
space k-cap u. Moreover the cap limits of space k-caps satisfy the first three 
tank conditions so that the groups g(¢) must sum to a maximal group of space 
k-caps as stated. 

The proof of the theorem is complete. 
om three preceding theorems are combined and extended in the following 

eorem. 


Turorem 8.7, J f the k-cap limits are well-ordered for all integers k, an arbt- 
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trary maximal group M; of k-caps is a-isomorphic with a maximal group of k-caps 


of the form 
(8.11) ge + i + Q: 


where g, is a suitably chosen maximal group of ambiguous k-cycles, gi-1 is a group 
of k-caps *-isomorphic with gx, and Q;, 1s an arbitrary maximal group of canonical 
non-bounding k-cycles. The group Q; is isomorphic with the k** homology group. 
When the dimensions p, and Ri, of Mi and Q;, respectively are finite the following 
relations hold 


Po > Ro, 
(8.12) 1 — pp 2R — Ro, 
po — Pi + Po =R.—h + Ro, 


There is a group L; of the form (8.11) which is a maximal group of k-caps 
by virtue of the three preceding theorems. It follows from Theorem 5.4 that 
L, is a-isomorphic with M;,. The group Q; is a maximal group of non-bounding 
k-cycles in accordance with Theorem 8.6 and so is isomorphic with the k* 
homology group. If; is the dimension of g, 


Pe — Re = VR + Ve-1 (yi = 0, k = 0,1, ---) 
provided these dimensions are finite. The inequalities (8.12) follow at once. 


III. Crrricau Points 


9. Homotopic critical points. As stated in the introduction we distinguish 
between differential, homotopic, combinatorial, and essential critical points. 
Of these types of points the last three are topological. To define homotopic 
critical points we shall admit deformations of the following type. 

Deformations. Let E be a subset of M. We shall consider deformations D 
of E which replace a point p initially on E at the time t = 0 by a point 


q = q(p, t) (pC E;0 St $7) 


on M at the time ¢, where ¢ varies on the closed interval (0, 7). We shall sup- 
pose that 7 is a positive constant and that q(p, ¢) is a continuous point function 
of its arguments when ¢ is on (0, 7) and p is restricted to any compact subset 
of E. The curve gq = q(p, t) obtained by holding p fast and varying t on (0, 7) 
will be termed the trajectory T defined by p. If a point q precedes a point r 
on a trajectory 7’, q will be termed an antecedent of r. 

We shall say that a deformation D of a set S admits a displacement function 
6(e) on S if whenever gq is an antecedent of r such that gr > e > 0 then F(q) - 
F(r) > 6(e) where 6(e) is a positive single-valued function of e. 

Consider the special case in which F is continuous, S compact, and D is 
such that F(q) > F(r) whenever q is an antecedent of r distinct from r. In 
this case D always admits a displacement function 6(e). If however F is merely 
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lower semi-continuous more stringent conditions are needed to establish the 
existence of a displacement function, as the following example shows. 

Example 9.1. Let the space M consist of the points (x, y) in the square 
0<2<1,0< y S 1. On this square let F = —zy except when y = 0. 
When y = 0 let F = —a. The function F is lower semi-continuous on M. 
Let E be the points of M on the y axis and let D be a deformation in which 
each point of E moves parallel to the x axis at a unit velocity for a unit of time. 
The deformation D is such that F(q) > F(r) whenever q is an antecedent of r 
distinct from r. Nevertheless the deformation D admits no displacement 
function on E. 

F-deformations. A deformation D of E which admits a displacement function 
on each compact subset of E will be termed an F-deformation of E, and E will 
be said to be F-deformable under D. We shall say that E is F-deformable 
definitely onto F, (written onto, F.) if E is F-deformable onto a point set on 
F < c — e where ¢ is a positive constant. 

In the special case where F is continuous and M compact, a point p at which 
F(p) = e will be said to be homotopically ordinary if there exists a neighborhood 
of pon F < ¢ which is F-deformable onto, F.. In the general case where F 
is lower semi-continuous and M not necessarily compact the definitions are 
more complex. The preceding definition is a special case of a definition which 
we shall now give. 

A limiting pair (p, c) (see §6) will be termed homotopically ordinary if there 
exists a neighborhood of p on F S c which is F-deformable onto, F,. A limiting 
pair (p, c) which is not homotopically ordinary will be termed homotopically 
critical, and p will then be termed semi-critical at the level c. A point p which 
is semi-critical at the level F(p) will be termed critical. 

A point p may be semi-critical at different levels c. For example the point 
(0, 1) in Example 1.1 is semi-critical at the levels c = 0 andc = 1. When F 
is continuous a semi-critical point p is a critical point. We shall presently 
apply our theory to the calculus of variations and shall thereby be concerned 
with a function F which is lower semi-continuous but not in general continuous. 
The most important special case in this variational theory is the locally convex 
case and in this case it will appear that each semi-critical point p is a critical 
point p. 

Differential critical points. Let F be a function F(a, --- , tn) of class C” 
in an open region R of the space (u). We term a point (x) = (a) differentially 
ertical if all the first partial derivatives of F vanish at (a). Otherwise we 
term (a) differentially ordinary. 

We shall show that a point which is differentially ordinary is homotopically 
ordinary. To that end let 


Xi = x(a, -°--, Qn, t) = 2;(a, t) (@ = 1,2, ---,n) 
be the trajectory defined by the differential equation 


dz; 


_ 
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with the initial conditions z,(a, 0) = a;. Upon replacing the point (a) by 
the point [z(a, t)] we obtain an F-deformation of the points (a) neighboring 
any ordinary point (a°) of R, provided t be restricted to a sufficiently small 
interval (0, r). With the aid of this F-deformation it is seen that each point 
of R which is differentially ordinary is homotopically ordinary. Hence 
homotopic critical point of F(x) is a differential critical point. The converse 
is not true as the differential critical point = 0 of the function 2° proves, 

The preceding section has affirmed the existence of cycle-caps. The existence 
of cycle-caps implies the existence of semi-critical points in accordance with the 
following fundamental theorem. 

THEorEM 9.1. Corresponding to each cycle-cap u there is at least one homotopic 
semi-critical point p at the level a(u) on each carrier of u. 

We shall prove this theorem with the aid of two lemmas. The first lemma 
is as follows. 

Lemma 9.1. Let D be an F-deformation of an e-neighborhood p, of a point p. 
Let A be a compact set. There exists an F-deformation D’ of A which is identical 
with D for points initially on pejs . 

The deformation D’ of the lemma can be defined as follows. Suppose that 
the given deformation D is defined for a time interval (0, 7). For the deforma- 
tion D’ the time ¢ shall vary on the same interval (0, 7). Points of A initially 
on p-/3 Shall be deformed in D’ as in D, while points of A not on pe./3 shall be 
held fast during D’. For points q of A for which 
(9.1) S<@s* 
we define D’ as follows. Let t¢, divide the time interval (0, 7) in the ratio 
inverse to that in which qp divides the interval (e/3, 2e/3). In the deformation 
D’, points q of A which satisfy (9.1) initially shall be deformed as in D until 
the time ¢ reaches ¢, and shall be held fast thereafter. It follows that D’ deforms 
points initially on A continuously. Further A admits a displacement function 
under D’, namely the displacement function belonging to the intersection of 
A with pe.3 under the deformation D. The deformation D’ is thus an F- 
deformation of A, and the proof of the lemma is complete. 

Our second lemma is as follows. 

Lemma 9.2. If A is a compact set on F < a which contains no semi-critical 
points at the level a there exists an F-deformation A of A onto, F.. 

If there are no points on A in limiting pairs of the form (p, a) the set A lies 
on, Ff, and we can take A as the identity. 

If (p, a) is a limiting pair with p on A, (p, a) is homotopically ordinary by 
hypothesis. There accordingly exists an F-deformation D, of a spherical 
neighborhood V, of pon F < a onto F < a — e, where é, is a positive constant. 
Let U, and W, be respectively spherical neighborhoods of p with radii one-third 
and one-sixth that of V,. The set of points p on A in limiting pairs (?, a) 
is closed and hence compact. There accordingly exists a finite set of the neigh- 
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horhoods W, which cover x. Let W,;,7 = 1, +--+, n, be aset of such neighbor- 
hoods. 

Let B; be an F-deformation of A which deforms the points of A initially 
on U,, asin D,,. Such a deformation exists by virtue of the preceding lemma. 
let G, = A. Proceeding inductively with 7 > 1 let G; be the final image of 
G;, under B;-1 and let B; be an F-deformation of G; which deforms the points 
of G; initially on Up, as in D,;. Such a deformation exists by virtue of the 


preceding lemma. 
We introduce the product deformation 


(9.2) A =B,--: Bi, 


applying this deformation to points of A. We understand thereby that a 
point q of A is first deformed under B, into a final image B,g. The point Byq 
is then deformed under By, into a final image B,B,q. The deformation A is 
the result of the application of the deformations B,, --- , B, in this order. 
It is clear that A deforms points of A continuously. 

We shall show that A is an F-deformation. To that end let 6;(e) be the 
displacement function for G; under B;. Let q be an antecedent of r under A 
ona trajectory \ whose initial point is p. If gr > e at least one of the deforma- 
tions B; must have displaced an image of p on dA between gq and 7, a distance 
greater than e/n. Hence 


(9.3) F(q) — F(r) > min 0; (‘) (i =1,---,n). 


The deformation A thus admits a displacement function. We denote this 
displacement function by 6(e). 

Let p be the minimum of the radii of the neighborhoods W,; and let 8 be 
the minimum of the numbers ey; and 6(p). We continue with a proof of the 
following statement. 

(i). The set A is F-deformed under A onto. Fa. 

It will be sufficient to show that the final images on W,, of points of A lie 
oF Sa— 8. To that end we need only prove that the final images on W,, 
of points of G; under B, --- B; = A; lie on F S$ a — B. Points of G; will be 
divided into two classes. | 

Class I. Points on Up; . 

Class II. Points not on U me 

Points of G; in Class I are deformed onto F < a — e, S a — 6 under B; 
and hence under A;. Points of G; in Class II whose final images ¢ under A; 
are on W,, are displaced under A; a distance at least p so that F is thereby 
decreased at least 6(p). These points ¢ thus lie on F < a — 8, and the proof 
of statement (i) is complete. 

Lemma 9.2 follows from statement (i) and the theorem follows from Lemma 9.2. 

Spaces locally F-connected. We shall give a brief description of a special 
hypothesis of interest. The space M will be said to be locally F-connected 








422 MARSTON MORSE 


for the order n if corresponding to n, an arbitrary point p on M, and an arbi- 
trary positive constant e, there exists a positive constant 6 with the following 
property. Forc = F(p) any singular n-sphere on F' S c (the continuous image 
on F < c of an ordinary n-sphere) on the é-neighborhood p; of p is the boundary 
of a singular (n + 1)-cell on F S c + e and on p,. The hypothesis of local 
F-connectedness is not implied by the ordinary form of local connectedness at 
least if F is merely lower semi-continuous. 

In the locally convex case of our abstract variational theory we shall be 
dealing with a space 2 and function F defined on © such that @ is locally F-con- 
nected. In this application the values F = 1 will correspond to curves of 
infinite length. 

Let c be any constant less than 1 and d be any constant greater than c. Let 
R,(c, d) denote the number of k-cycles in a maximal linear set of k-cycles on 
F < c not homologous to zero on F S d. We state the following theorem. 

THEOREM 9.2. If the space M is locally F-connected for orders at most k + 1, 
the numbers R,(c, d) are finite for c < 1 and the inferior cycle limits of non-bound- 
ing k-cycles cluster at most at c = 1. 

The proof of the first statement of the theorem while not difficult involves 
too great detail to be presented here. The essential ideas are those in analogous 
proofs in Lefschetz, ref. 3. The inferior cycle limits of non-bounding k-cycles 
can have no cluster value b < 1. Otherwise corresponding to any constant c 
such that b < c < 1 there would exist an infinite sequence u” of non-bounding 
k-cycles on F S c with distinct inferior cycle limits less than c. The ranks of 
the cycles u” would be different so that the cycles u" would generate a group 
of non-bounding k-cycles of infinite dimensions. It would follow that R;(c, 1) 
would be infinite contrary to the first statement in the theorem. 

The proof of the theorem is complete. 


10. Homotopic critical sets. We are concerned in this section with homotopic 
critical points. With this understood the word homotopic will ordinarily be 
omitted. 

By a complete critical set w at the level c is meant the set of all semi-critical 
points at the level c. It follows from the definition of a semi-critical point 
that w is closed. The set w is compact if c < 1 since the domain F S c is then 
compact and w is a closed subset of this domain. Recall that the components 
of a point set A are the maximal connected subsets of A and are closed in A. 
The components of A are disjunct and sum to A. We shall term any sum 
of components of w a critical set o at the level c. 

If o is at a positive distance from w — a, o will be termed separate. A neigh- 
borhood of « which is at a positive distance from w — o will also be termed 
separate. 

Let R be a subset of M. If we regard R as a space M, waiving the condition 
that the subspace F < ¢ of R be compact when c < 1, a k-cap on F has a new 
meaning dependent on R and will be specially designated as a k-cap rel R. 
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The following theorem is useful. 

TueoreM 10.1. Let o be a critical set at the level c with a separate neighbor- 
hood U. 

(a). If wis a k-cap with cap limit c rel U, u is a k-cap rel M. 

(b). The k-cap wu is c-homologous on U to a k-cap v on an arbitrarily small 
neighborhood of o. 

We shall prove Theorem 10.1 with the aid of two lemmas, the first of which 
follows. In these lemmas the e- or y-neighborhood of a point set X will be 
denoted by X, or X, respectively. 

Lemma 10.1. (i). Let U be a separate neighborhood of a critical set o at the 
level c. Let b be a constant less than c, and n an arbitrary positive constant. A 
k-cap u with cap limit c and with carrier x on U + F; ts c-homologous on x to a 


k-cap on U,. 
(ii). Let the complete critical set w at the level c be the sum of critical sets 
c(i = 1, «+ , n) with neighborhoods U" at positive distances from one another. 


Let u' be a k-cycle mod, F < conF S cand U', such that the sumu' + --- +u"=u 
is c-homologous to 0 on Fy, + ZU ‘Then u' is c-homologous to 0 on U * for each i. 

We shall first prove (i). 

Let u, be the p** component of u, and H, the relative algebraic homology 
connecting u, and Up41 in the definition of wu. Of the cells on the complement 
of U we drop each k-cell of wp, and each (k + 1)-cell in H,. Let u, and Hj, 
be the k-chain and relative homology which thereby replace u, and H, respec- 
tively. The sequence u, together with the homologies H’, defines a k-cap wu’, 
c-homologous to u on x. If m is a sufficiently large integer the subsequence 
u, for which p = m together with the corresponding homologies H’, will define 
ak-cap zon U,. Moreover z and wu’ and hence z and wu will be c-homologous 
on kK. 

The proof of (i) is complete. 

We come to the proof of (ii). 

We set SU" = V and let e be an arbitrary positive constant less than one- 
half the minimum distance between pairs of neighborhoods U‘. By hypothesis 
vis c-homologous to 0 on V + Fy. From the algebraic homologies which define 
this c-homology we drod all (k + 1)-cells which are on the complement of V 
and all algebraic homologies whose norms are greater than e/2. The resulting 
algebraic homologies will hold on V.2 and we infer that u is c-homologous to 
0onV.. It then follows from the choice of e that the cycle wu‘ is c-homologous 
to 0 on the corresponding neighborhood U3} . 

The proof of (ii) is complete. 

We come to a deformation lemma. 

LemMa 10.2. Let x be a compact subset of F < c on a separate neighborhood 
of a critical set at the level c. Corresponding to an arbitrary positive constant e 
there exists a positive constant b < c and an F-deformation T of x onto o. + Fr 
in which points of x which have been displaced a distance at least e lie on F,. 

Sete = 3p. Let « and xz be respectively the subsets of x not on o, and oz. 
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There are no semi-critical points on x, at the levelc. It follows from Lemma 92 
that there exists an F-deformation A of x; onto F <c¢ — » where 7 >0. The 
methods used in the proof of Lemma 9.1 make it clear that the deformation 4 
properly modified will yield an F-deformation T of « in which « — x; is held 
fast while x is deformed as in A. Let 6(e) be the displacement function for 
under 7’. 

Points of x are on xe Or o2,. Points of x on x2 are F-deformed under T onto 
F <c-— 4. Points of x on oa, are F-deformed on o3, = o, or else displaced a 
distance at least p, and hence deformed onto F S c — 6(p). Under T points 
of « which have been displaced a distance at least e will lie on F < c — 6(e), 
If b is less than ¢ and greater than c — 7, c — 6(p) and c¢ — 4(e) the lemma 
holds as stated. ; 

We shall now prove Theorem 10.1 beginning with a proof of (a). 

Let @ be a carrier of u on U and F S$ c. We assume (a) false. It follows 
that wu is c-homologous to 0 with homology carrier x D @ on F S c. Corre- 
sponding to the complete critical set w, and to the compact set « Lemma 10.2 
affirms the existence of a constant b < c and an F-deformation 7’ of « onto 
we + F,. Under T, @ is deformed on 0, + Fy. Hence wu is c-homologous to 
Q0Oonw +6.+ Ff. 

We distinguish between the cases w = o andw # o. If w ¥ o we apply 
Lemma 10.1 (ii) with n = 2, setting 
o=0c, ¢ =w-—s, ‘oyu wW=0, U=Hnt+h, U=0, 
taking e so small that U' C U, and U * is at a positive distance from U 1 We 
see that wu is c-homologous to 0 on F, + ZU’ and infer that u is c-homologous 
to 0on U. If w = o we again apply Lemma 10.1 (ii) with n = 1. In both 
cases we conclude that wu is c-homologous to 0 on U. 

From this contradiction we infer the truth of (a). 

We turn to a proof of (b). 

Let e be so small a positive constant that a2, lies on U. The k-cap u lies 
on U and it follows from Lemma 10.2 that u is c-homologous to a k-cap w 
ono. + F,. Since o, is a separate neighborhood of o it follows from Lemma 
10.1 (i) that w is c-homologous to a k-cap v on oz, (setting 7 = e). Thusu—? 
is c-homologous to 0. But from our choice of e, u — vison U. It follows from 
(a) that wu — v is c-homologous to 0 on U. But » is on oe, and e may be arbi- 
trarily small. 

Statement (b) follows directly. 

A k-cap u with cap limit c will be said to be associated with a critical set 
if there is a k-cap in the a-class of u with a carrier on an arbitrary neighborhood 
of son F Sc. With this understood we state the following corollary. 

Corotiary 10.1 If o is a critical set at the level c with a separate neighborhood 
U, a maximal group g of k-caps rel U with cap limits c is a maximal group of 
k-caps associated with o. 

It follows from (a) in Theorem 10.1 that each k-cap u of g is a k-cap rel M 
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and from (b) that w is “associated” with o. Thus g is a group of k-caps asso- 
ciated with o. That g is maximal among k-caps v associated with o is seen as 
follows. The k-cap v is in the a-class of a k-cap u rel U by virtue of its asso- 
ciation with o. But g is maximal among k-caps rel U with cap limit c, so 
that u and hence v is in the a-class of a k-cap of g. The corollary thus holds 
as stated. 

It follows from Theorem 5.1 that any two maximal groups g of k-caps with 
cap limit ¢ are a-isomorphic. The following theorem adds an important mode 
of determination of a particular maximal group of k-caps. 

TuEoreM 10.2. Let the complete critical set w at the level c be the sum of critical 
sets o' (i = 1, «++ , n) with neighborhoods V" at positive distances from one another 
and let g' be a maximal group of k-caps rel V* with cap limit c. The groups g' 
admit a direct sum g and this sum is a maximal group of k-caps with cap limit c. 

Vn begin by proving the following statement. 

(a). If v' is an element of g' the sum v = v' + --- + v" is c-homologous to 0 
only if for each 7, v' is c-homologous to 0 on V*. 

If vis c-homologous to 0 it follows from (a) Theorem 10.1 that v is c-homologous 
to0on ZV‘. Since the neighborhoods V" are at positive distances from one 
another we infer that v’ is c-homologous to 0 on V‘, and the proof of (a) is 
complete. 

It remains to prove statement (). 

(6). A k-cap v with cap limit c is c-homologous to a sum v +--+: +0" of 
elements v' in the respective groups g’. 

It follows from Theorem 10.1 (b) that v is c-homologous to a k-cap on an 
arbitrarily small neighborhood of w and hence to a sum of cycles u' mod, F <c 
on F < c and on the respective neighborhoods V‘. But each such cycle u’ is 
c-homologous on V* to an element v* of g* in accordance with the definition 
of g', and (8) follows immediately. 

Theorem 10.2 follows from (a) and (8). 

The dimension of a maximal group of k-caps associated with a critical set o 
will be termed the kt type number of o. It follows from Corollary 10.1 that 
the k** type number of a separate critical set ¢ is the dimension of a maximal 
group of k-caps with cap limit c relative to a separate neighborhood of o. We 
thus have the following corollary of Theorem 10.2. 

CorotLary 10.2. The kt type number of the complete critical set w is the sum 
of the k*” type numbers of the critical sets o', «++ , 0”. 

A k-cap u with cap limit c which is associated with a critical set o will be 
said to belong to o if u is associated with no proper critical subset of ¢. The 
basic theorem here is as follows. 

THEOREM 10.3. A k-cap u with a cap limit c < 1 belongs to at least one critical 
set at the level c. 

We shall not use this theorem and accordingly omit its proof. 

N on-degenerate differential critical points. We shall consider the important 
case in which M is compact and in which there is a neighborhood of each point 
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p of M homeomorphic with a region U of a euclidean n-space of rectangular 
coordinates (x). Neighboring any point q of U we admit any system of ¢o- 
ordinates obtainable from the coérdinates (x) by a non-singular transformation 
z; = 2(x) with z,(x) of class C’, and we admit only such coérdinate systems 
neighboring g. A space M of this character will be said to be locally of class (°. 
We suppose that our function F equals a function g(x) of class C* corresponding 
to each admissible set of codrdinates (2). 

A differential critical point (2) of g(x) will be termed a differential critical 
point of F. The point (xo) will be termed degenerate if the Hessian of ¢ at 
(xo) is zero. We assume here that the differential critical points of F are non- 
degenerate. As in the introduction the index of a differential critical point (2) 
shall be the index of the quadratic form whose coefficients make up the Hessian 
of g at (a). From the hypothesis of non-degeneracy it follows that the differ- 
ential critical points are isolated, and hence finite in number. 

As in §9 the trajectories orthogonal to the manifolds g constant in the local 
coordinate spaces (x) will serve to prove that homotopic critical points are 
differential critical points. A proof of the following theorem is more difficult. 

THEOREM 10.4. A non-degenerate differential critical point o is a homotopic 
critical set and the j** type number of o equals the Kronecker delta 6}, where k is 
the index of o. 

We suppose that o is represented by the point (x) = (0) in the codrdinate 
system (xr) and that in this system F = g(x). We also suppose that ¢(0) = 0, 
waiving the condition that 0 < F < 1. Setting 


flv) = -ai-— ++) at tata te tah, 


we introduce a preliminary transformation as follows: 

(a). If r is a sufficiently small positive constant the space A defined by the condi- 
tions ¢ S 0 and 2;2; < r’, and the space B defined by the conditions f < 0 and 
iti < r admit a homeomorphism in which the subspaces f < 0 and ¢ < 0 cor- 
respond. 

It follows from methods developed in detail on pp. 169-170 of M1 that the 
space yg S 0, 2:2; = r° and the space f < 0, x;2; = r° admit a homeomorphism 
in which the subspaces f < 0 and ¢ < 0 correspond. A use of the radial tra- 
jectories of pp. 156-157 of M1 then gives an easy proof of (a). 

We shall make use of Corollary 10.1, determining the j** type number of ¢, 
if such a number exists, by determining the dimension » of a maximal group 
of j-caps with cap limit 0 relative to a separate neighborhood of ¢ on ¢ & 0. 
The space A is such a neighborhood if r is sufficiently small, since each homotopic 
critical point is a differential critical point and the only differential critical point 
on A at the level 0 is o if r is sufficiently small. It follows from (a) that » 
equals the dimension of a maximal group of j-caps with cap limit 0 rel B with 
f replacing g. We continue, replacing A by B and ¢ by f. 

If k = 0, B consists of the point c, and the theorem is immediate. We 
therefore suppose that k > 0. When k > 0 we shall prove the following: 
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(b). The homology groups of the subspace f < 0 of B are isomorphic with those 
of the (k — 1)-sphere. 

(c). The only non-null connectivity of Bis Ry = 1. 

(d). Each j-cap u rel B with a(u) = 0, is non-linkable. 

(e). The only j-caps u rel B with a(u) = 0 are k-dimensional, and a maximal 
group of such k-caps has the dimension 1. 

The space B and its subspace f < 0 can be respectively deformed on them- 
selves onto the dise 


2 
(10.1) tte tabs 5, Ty = +++ = 2, = 0 


and its spherical boundary S, holding the disc and S fast respectively. The 
truth of (b) and (c) follows. 

Proof of (d). If there were a linkable j-cap e rel B with a(u) = 0 there 
would be a j-cycle v in the a-class of u. But any j-cycle on B is homologous 
on B toa cycle on S, and f = —r'/2 on S. Hence v cannot be a j-cap with cap 
limit 0 and wu cannot be linkable. 

Proof of (e). If u is a j-cap rel B with a(u) = 0, u is non-linkable. By 
definition then, Bu is on f < O and is there non-bounding. It follows from 
(b) that 7 = lork. If k = 1, the first statement in (e) is true. If k > 1, 
Isay that 7 = k. For if j were 1 it would follow from the connectedness of the 
subspace f < 0 of B when k > 1 that Bu ~ 0 on, this subspace, contrary to the 
fact that w is non-linkable. Hence j = k in all cases and the proof of the first 
statement in (e) is complete. 

To prove the second statement in (e) recall that the subspace f < 0 of B 
contains a (k — 1)-cycle u which is non-bounding on, this subspace but bound- 
ing on B. Hence there is at least one k-cap u rel B with a(u) = 0. 

A maximal set of k-caps rel B with a(u) = 0 will contain at most one k-cap. 
To prove this let u and v be two such k-caps. It follows from (b) that there 
exists a proper homology of the form 


(10.2) 5, Bu + 62.80 ~ 0 (on, f < 0). 


The relative k-cycle 6,u + 6:0 cannot be a k-cap at the level 0 since it would 
then be linkable in accordance with (10.2). Hence 6,u + é9v is not a k-cap 
at the level 0, and the proof of (e) is complete. 

We can conclude that o is a homotopic critical point and set at the level 0. 
For no other point on A at the level 0 is a homotopic critical point as noted 
previously. But 0 is a cap limit rel A as the preceding paragraphs show. It 
follows from Theorem 9.1 that ¢ is a homotopic critical point. Since o is an 
‘solated critical point it must also be a critical set, and the proof of the theorem 
is complete. 

A compact space M locally of class C” has finite connectivities (cf. Lefschetz, 
Deformations, l.c.), and if M is locally n-dimensional the connectivities of 
order greater than n are null. If F has at most non-degenerate critical points 
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the number of critical points is finite and it follows from Theorems 10.4, 10.2 
and 5.3 that a maximal group of k-caps wil! have a dimension m, equal to the 
number of critical points of index k. In particular m; = Oif j > n. Referring 
to the relations (8.12) we have the following theorem. 

TueoremM 10.5. Let F be a function of class C” on an n-dimensional compact 
space M locally of class C’. If the differential critical points of F are non-degen- 
erate the numbers px of critical points with index k and the connectivities R; of M 
satisfy the relations (0.2) of the introduction. 

Essential critical points. A homotopic critical set o to which at least one 
k-cap belongs will be termed homotopically essential and in such a case a semi- 
critical point of will be termed homotopically essential. The class of homotopic 
semi-critical points B thus has a subclass C of points which are termed essential. 
Thus B > C. That the class C may be a proper subclass of B is shown by the 
following example. 

Example 10.2. Let M be a curve y = f(x) in the (2, y)-plane defined as 
follows: 


fle) = sin 2 (—r S20), 
f(z) = 0 (0< 27), 
f(x) = sinz (rf Sx S Qn). 


On M we set F = (y + 1)/2 so that 0 < F <1. The points of the curve at 
which 0 < z S =x are homotopically critical and form a critical set but not an 


essential critical set. 
We have already seen that the class A of differential critical points in problems 
where F is differentiable is such that A > B. We thus have the relation 


(10.3) | ADBDC, 
but B may be a proper subset of A and C a proper subset of B. 


IV. AspsTrAcT VARIATIONAL THEORY 


11. Length. Let N be a space of points p, g, 7, --: with a metric which in 
general is not symmetric. That is to each ordered pair p, q of points of N 
there shall correspond a number denoted by pq and called the distance from 
p tog. These distances shall satisfy the conditions: 


I. pp = 0, 
Il. pg > Oif p ¥ gq, 
Ill. pg S pr + 7@. 


The relation III is not to be written pg < pr + gr inasmuch as this would 
imply that pg = gp upon setting r = p. The spaces N satisfying I-III include 
those with a symmetric distance. For various applications, for instance to the 
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calculus of variations, it is not desirable to restrict ourselves to the symmetric 
ease, The non-symmetric case is illustrated by the integral 


(11.1) = | (cu+-vErHa (0<2z< 1). 


The value of J along a curve in general is changed when the sense of integration 
along the curve is reversed. Curves which are extremals in one sense need 
not be extremals in the other sense. The absolute minimum of J from one 
point to another is a distance which satisfies I-III but which is not in general 
symmetric. 

Let | pq | denote the maximum of pg and gp. We shall call | pq | the absolute 
distance between p and g. With | pq | as a distance N becomes a metric space 
with a symmetric distance function. We shall denote this new space by | N |. 
We shall use the metric of N when defining length. For all other purposes, in 
particular in defining point functions on N or | N |, or general topological 
properties, we shall use the metric of |_N |. One reason for this is that pq does 
necessarily approach zero when gp approaches zero, unless indeed we add 
further conditions to the conditions I-III. 

We understand that a function f(pi,---,p,) of points (p:,---, pn) on 
|N | is continuous at (q:, --+ , dn) if f(pi, --+ , pn) tends to f(a, ---, Gn) asa 
limit as max | p;q;| tends to zero. This is the familiar concept of continuity 
on|N |. A function is called continuous on a point set A if continuous at each 
point of A. We note the following: 

(a). The function pq is a uniformly continuous function of p.and q on | N |. 

Postulate III implies that for points p, q, p’, q’, 


(11.2) pa spp +p t+ 749. 


If we assume that | pp’| < 6 and | qq’| < 6 then (11.2) implies that pg < 
p'? + 26. Upon interchanging p and p’, q and q’ we find that pq and p’q’ 
differ by less than 25 so that (a) is true. 

For each value of ¢ on a closed interval (0, a) let g(t) be a point on | N |. 
The function q(t) is termed continuous at t if the absolute distance | q(t’) q(t) | 
tends to zero as ¢’ tends to ¢. If q(é) is continuous at each point ¢ on (0, a) 
the set of points g(t) taken in the order of the corresponding values of ¢ will 
be termed a parameterized curve \ (written p-curve \). We say that d is the 
continuous image on |_N | of the line segment (0, a). In general curves on 
| NV | will be denoted by Greek letters a, 8, 7, A, etc., while points on | N | will 
be denoted by letters P, 4, 7, ete. 

The length of \. Let t; be a set of values on (0, a) such that 0 = & <ti < 
“* <t, = a, dnd let po, pi, +++ , Pn be the corresponding points on \. The 
sets t; and p; will be termed partitions of \ of norm 6 where 6 is the maximum 
of the differences tisnn — t;. We term 


n—1 
& = > Pi Pi+1, 
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a sum approximating the length of \ and take the length L of as the least 
upper bound of the sums for all partitions of A. The proof of the following 
theorem can be readily supplied by the reader familiar with the ordinary theory 
of length in euclidean spaces. Explicit use of course must be made of postu- 
late III. 

THEoREM 11.1. The length of d is the limit of any sequence =, of approzi- 
mating sums = corresponding respectively to partitions of X whose norms tend 
to zero as n becomes infinite. 

Fréchet distance. Let » and ¢ be two p-curves given by the respective 
equations 


(11.3) p = p(t) (0 
(11.4) q = q(u) (0 


IA 


t 


IIA 


a), 


b). 


IIA 


U 


lA 


Let H be a sense preserving homeomorphism between the closed intervals 
(0, a) and (0, b). The homeomorphism between 7 and ¢ defined by H will be 
termed admissible. Set 


d(H) = max | p(t) q(u) |, 


where ¢ and wu correspond under H. Then the Fréchet distance nf between 
n and ¢ is the greatest lower bound of the numbers d(H) as H ranges over all 
admissible homeomorphisms between 7 and ¢. 

We observe that nf = <n = 0. If moreover 7, ¢, A are three p-curves, 


(11.5) . mr S nf + fr. 


To establish (11.5) let H be an admissible homeomorphism between » and { 
under which a point p of 7 corresponds to a point q of ¢. Let K be an ad- 
missible homeomorphism between ¢ and \ in which q of ¢ corresponds to r of }. 
Let KH be the resultant transformation of 7 into \ carrying p of 7 into r of }. 
From the relation | pr| < | pq| + | gr | we find that 


(11.6) d(KH) s d(H) + d(K). 


Relation (11.5) follows from (11.6). 

By the space of the p-curves will be meant a space in which the elements are 
p-curves and the distance between elements the Fréchet distance. 

We continue with a proof of a well-known theorem. 

TueoreM 11.2. The length of a p-curve n is a lower semi-continuous function 
of n in the space of p-curves ». 

We are to show that if a < L(n) there exists a positive 6 such that L(S) > ¢ 
whenever nf < 6. 

Let b be a constant between a and L(n). By virtue of the definition of L 
there exists a partition po, p1, --- , Pn of » such that 


(11.7) pe PiPisi > b (¢ = 0,1,°°°,"- 1). 
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We choose 6 so that 2n5 <b — a. If nf < 6 there exists a homeomorphism 7’ 
between 7 and ¢ such that corresponding points have an absolute distance 
less than 6. In particular if q,; is the correspondent of p; under 7 the points q; 
determine a partition of ¢ such that | piq; | < 6 and hence 


(11.8) QiQinn > Di Pixs — 26 (¢ = 0,1,---,n — 1). 
With the aid of (11.8) we see that 


L(g) 2 Z Vidi > > Pi Dizi — 2nd. 
It follows from (11.7) and our choice of 6 that 
Li) = Dad > b — (6 — a) =a, 


and the proof is complete. 
If pi, °**, Pn is an arbitrary sequence of points on | N | it follows from 
postulate III that 


PiPn S Pip2 + props + +++ + Pn-ipn. 


This inequality together with the definition of length leads to the following 
theorem. 

THEorEM 11.3. The length of a p-curve joining two points p and q on | N | 
is at least pq. 

u-Length. Let » and ¢ be the p-curves given by (11.3) and (11.4) respectively. 
We shall say that ¢ is derivable from 7 if there exists a continuous non-decreasing 
function w = u(t) which maps the closed interval (0, a) onto the closed interval 
(0, b) and under which p(t) = q[u(t)]. Two p-curves which are derivable 
from the same p-curve 7 will be said to be p-equivalent. The two p-curves 
nand ¢ will be said to be identical, » = ¢, if in the representations (11.3) and 
(11.4) a = b and p(t) = q(t). To show that p-equivalence is transitive it will 
be sufficient to verify the truth of the following statement. 

(a). Corresponding to each arbitrary p-curve there exists a p-curve ¢, from which 
n 1s derivable such that ga = yg whenever the p-curves a and B are p-equivalent. 

If a p-curve of the form (11.3) has finite length, the length s(¢) of » from 
the point ¢ = 0 to a general point ¢ is a continuous non-decreasing function 
oft. if ¢ = ¢(s) is the function inverse to s(t), plt(s)] is a single-valued con- 
tinuous point function of s for s on the interval 0 < s < s(a). We set 
4(s) = p[t(s)]. The curve g = p(t) is derivable from the curve g = q(s) upon 
setting s = s(t). We term q = q(s) the s-curve determined by n. If ¢ is de- 
nivable from 7 it is clear that the s-curve determined by ¢ is identical with the 
‘curve determined by ». For curves with finite length the curve ¢, corre- 
sponding to 7 in statement (a) could be taken as the s-curve determined by 7. 
It follows that p-equivalence is transitive for p-curves of finite length. 

But for curves with infinite length’ there are no s-curves. Nevertheless it is 
possible to replace length s by what will be termed y-length and to associate 
With each p-curve 7 a p-curve ¢, such that statement (a) is satisfied without 
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exception. The parameter of these new curves will be denoted by u and the 
curves themselves will be termed u-curves.” These u-curves will not only 
serve as the p-curves ¢, of (a) but will have other important properties as follows. 

(b) The parameter u on a p-curve ¢, varies from 0 to a value u(n). The function 
u(n) is continuous in n in the space of p-curves n and never exceeds the diameter of 1, 

(c). The condition that » and ¢ be p-equivalent is equivalent to the condition that 
the u-curves y, and gy be identical and to the condition nf = 0. 

(d). Let a be a parameter which ranges from 0 to 1 inclusive. Let the point 
on gy at which » = apu(d) be denoted by g(d, a). Then q(d, a) is continuous in 
its arguments. See ref. 16. Page 441. 

The following is a particular consequence of (d). 

(e). Corresponding to a preassigned (fixed) p-curve n and an arbitrary positive ¢ 
there exists a positive 6 such that if ni < 6, points on ¢, and yg; which divide o, 
and gy in the same ratio with respect to variation of u have an absolute distance 
at most e. 


12. The space 2. A class of p-equivalent p-curves will be termed a curve. 
The p-curves of a given class \ have a 0-distance from each other. The dis- 
tance \r between two curves \, 7 shall be defined as the distance nf between 
any two p-curves 7 and ¢ in the respective classes \ and r. It is clear that the 
distance \7r is independent of the curves 7 and ¢ chosen to represent \ and 7 
respectively. It follows from property (c) of u-curves in §11 that Ar = 0 if 
and only if the classes \ and 7 are identical. Curves X, 7, w, --- accordingly 
form a symmetric metric space M of the same character as the space M of §1. 

Let a and b be two points on | N |. The set of curves joining a to 6 will 
form a subspace Q(a, b) of the space M of all curves on | N |. We shall investi- 
gate the properties of Q(a, b). 

Recall that a subset A of a metric space is termed complete if every Cauchy 
sequence in A converges to a point in A, and conditionally compact if every 
infinite sequence of A contains a Cauchy subsequence. (See Hausdorff, |.c.) 
If A is conditionally compact and complete A is compact. 

Apart from trivial cases where a = b, or the set Q(a, b) is empty, 2(a, b) is 
not conditionally compact. In fact if Q(a, b) contains a curve 2 it is possible 
to define an infinite sequence of curves A, each of which joins a to b and is 
defined by a point p(t) which varies continuously on \ but is such that the 
sequence ), contains no Cauchy subsequence. 

We have denoted the length of a p-curve by L. We note that the value of L 
will be the same for each p-curve of a class \ of p-curves which are p-equivalent. 





18 Morse, M., A special parameterization of curves, Bulletin of the American Mathemati- 
cal Society, 42 (1936). This paper contains a proof of statements (a), (b), (c), (d) anda 
definition of y-curves. This definition is a modification of a definition of a function of 
sets given by H. Whitney and applied by Whitney to families of simple non-intersecting 
curves. See Whitney, H., Regular families of curves, Annals of Mathematics, 34 (1933), 
pp. 244-270. 
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This common value of L will be termed the length of \ and again denoted by L. 
We understand that L may be infinite. 

We shall prove the following theorem. See Fréchet ref. 17, page 441. 

TuroreM 12.1. If |.N | is compact and c is a finite constant, the subspace 
L < cof Q(a, b) ts compact. 

We shall begin with a proof of statement (i). 

(i). If | N | is complete, Q(a, b) ts complete. 

Let \1 , Ae , be a Cauchy sequence A of curves of Q(a, b). Given e > 0, there 
exists an integer N so large that \;A; < e wheneveri > N andj > N. We 
choose a subsequence of A as follows. Let m be.chosen from A so that the 
distance of m from every curve \, following m in A is less than 3. Proceeding 
inductively, for i > 1 choose ; from curves A; following n;-; in A so that if \,, 
follows 7; in A, 


Nilm < a. 
The sequence m, 72, *** iS a subsequence of A with the property that 
(12.1) nina < 27. 
Let 71, 72, °°* be a sequence of homeomorphisms of which 7; is a homeo- 


morphism between »; and 7:41 in which corresponding points p; and pi, have 
an absolute distance less than 2~*. Such homeomorphisms exist by virtue of 
(12.1). For 7 > «7 the homeomorphisms 7; 744; --+ 7;-1 taken in the order 
written yield a homeomorphism 7;; between ; and 7; by virtue of which corre- 
sponding points p; and p; have an absolute distance 


1 1 


we 
(12.2) lpepil < a+ ge t+ + oR < ge 


Suppose 7 is represented by the p-curve p = pi(t),0 S$ ¢ <1. By virtue 
of the homeomorphism 7; a p-curve p = 7p,(t) representing; is uniquely 
determined, in such manner that p,(¢) and p,(¢) correspond under 7;;. Relation 
(12.2) now takes the form 


(12.3) | ps(t) p,(t) | < —_ 
It follows from the completeness of | N | that the sequence of points p,(t) con- 
verges uniformly with respect to ¢ to a continuous point limit p(t). The se- 
quence ; thus converges to the curve represented by p = p(t), and the proof 
of (i) is complete. 

We continue with a proof of statement (ii). 

(ii). If | N | is conditionally compact, the subspace L  c of Q(a, b) is condi- 
tionally compact. 


We shall make use of the n-fold product | N |" of | N |. The distance between 
two points 


(p) = (1, «++ , Dn); (q) = (m1, -°* 5 Qn) 
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of | N |" shall be defined as the maximum of the distances | p;q;| and shall be 
denoted by (p)(q). 

Let A° be an infinite sequence of curves \},\2, °°: on L Se. Let e pay oes 
be a sequence of positive constants tending to zero. Let m > 0 be an integer 
so large that the division of the curve \! into parts of equal variation of length 
by the ordered set of points 


yields segments of \? with absolute diameters less that e,/3. An integer n, 
exists since we are concerned with curves of length at most c. 

Since | N | is conditionally compact, | N |"' is conditionally compact. The 
points (p') in the space | N |"! will thus have a Cauchy subsequence (r’), 


j = 1, 2,---, and we can assume that the sequence (r’) has been so chosen 
that 
(12.5) (or) <3 (i,k = 1,2, --»), 


Let \} be the curve of A° of which (r’) is a partition. It follows from (12.5) 
and from the fact that the set of points (r’) divide dj into segments of absolute 
diameter less than e;/3 that 

(12.6) Nie < 1 (j,k = 1,2, +++). 


The sequence d; has been selected from the sequence \;. We repeat this 
process replacing e; by e: and \} by Aj, thereby obtaining a subsequence Ni 


of \;. More generally we obtain a subsequence Aj, Az, --+ Of the sequence 
AT’, A, «++ of such a nature that 
ae < «, (i, k = 1,2,-+-). 


The sequence 
(12.7) iy 
is a subsequence of A° and has the property that 
(12.8) AnAm < er (if n, m >"). 


It is accordingly a Cauchy subsequence. 

The proof of (ii) is complete. 

We can now prove the theorem. It follows from (i) that the space 2(a, b) 
is complete. The subspace L < c of Q(a, b) is closed since L is lower semi- 
continuous. Hence this subspace is complete. The theorem follows from (ii). 

Since F is lower semi-continuous Theorem 12.1 has the following corollary. 

Corotiary. If Q(a, b) is not empty there exists at least one curve d of 2(4, b) 
whose length (possibly infinite) is a minimum™ among lengths of curves of 2(4, b). 





14 See Tonelli, Fondamenti di calcolo delle Variazioni I and II. References will here be 
found to the work of Hilbert on the absolute minimum. 
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If H is a component (in the point set sense) of Q(a, b) there is at least one curve 
¢ of H whose length (possibly infinite) is a minimum among lengths of curves of H. 

A class of curves of Q(a, b) continuously deformable on | N | among curves 
of O(a, b) into a given curve of Q(a, b) will be termed a homotopy class. Among 
curves joining two fixed points a and 6 on a torus for example there are infinitely 
many homotopy classes, and on a torus these homotopy classes are identical 
with the components of Q(a, b). Under the condition of local convexity to be 
developed in the next chapter we shall see that the homotopy classes are iden- 
tical with the components of 2. Our corollary will then affirm the existence 
of at least one curve in each homotopy class K of Q with length a minimum 
among lengths of curves of K. 

Let H*(a, b) be the kt* homology group of Q(a, b). Let x © y be read “z is 
simply isomorphic with y.” We see that H‘(a, b) © H*(b, a). We shall 
extend this result by proving the following theorem. 

THEOREM 12.2. If | N | ts arcwise connected, 


(12.9) H*(a, b) © H'(c, d), 


where (a, b) and (c, d) are arbitrary pairs of points on | N |. 

We shall begin by proving statement (i). 

(i). If there is an arc on | N | joining b to c then H*(a, b) © H*(a, c). 

Let » be a curve joining b toc on |N |. Let d be a curve joining a to b. 
By the extension of X by » we mean the curve obtained by tracing A and 7 
successively. We denote this extension by (A), and observe that (A), joins 
a toc. 

Let a; be an oriented k-cell on Q(a, b). By the extension (a,), of a, by 
we mean the oriented k-cell on Q(a, c) obtained by replacing each vertex of a, 
by its extension by 7, understanding of course that a vertex of a; is a curve of 
O(a, b). Let 2, = 6‘a; be an algebraic k-chain on Q(a, b). By the extension 
(zx), of z, is meant the k-chain 


(12.10) (zz)y = 8°(ai), [on 2(4, ¢)}. 
We observe that 
(12.11) B(Zx)q = (Bzx)n, 


that is boundary and extension operators are commutative. If u; is the j™ 
component of a Vietoris cycle u, the algebraic j-cycles (z;), form a new Vietoris 
k-eycle (u), on Q(a, c). This follows in part from (12.11) and in part from the 
fact that the distance between two curves of Q(a, b) equals the distance between 
their extensions on Q(a, c), so that norms are not altered by the process of 
extension. 

Let n’ be the curve obtained by tracing 7 in the opposite sense. If z, is an 
algebraic k-cycle of norm e on Q(a, b), set 


(12.12) ze = [(Zu)ale’ - 


We see that z¢ is an algebraic k-cycle of norm e on Qa, b). 
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We shall define a deformation D which continuously deforms z, into zi on Q(a, b). 

To that end let » be given in the form p = p(r) with 0 S + S 1 and let », 
denote the subare of 7 on which 0 S tr S ¢ S$ 1. Under D suppose that an 
initial point (i.e. curve) of Q(a, b) is replaced at the time ¢ by the point (ic. 
curve) 


[(A) neler (sts 1) 


of Q(a, b) where 7; represents 7 taken in the opposite sense. The deformation 
D carries z; into Ze as stated. Hence 


(12.13) Ze “ets 2k [on Qa, b)], 


where e is the norm of z, and z and 6 is an arbitrary positive constant. 

We return to the proof of (i) and set up an isomorphism between the groups 
H*(a, b) and H*(a, c). 

To a k-cycle z of Q(a, b) corresponds the cycle (z), of Q(a, c). It follows 
from (12.11) that k-cycles which are homologous on Q(a, b) thereby correspond 
to k-cycles which are homologous on Q(a, c). Each homology class of H*(a, b) 
is thus mapped onto a homology class of H*(a, c). We designate this mapping 
by 7. That this mapping is a homeomorphism follows at once from the defini- 
tion of the operation of extension by 7. It remains to prove that T is an 
isomorphism. 

The mapping T leads to each class of H“(a, c). If u is a k-cycle on Qa, ¢), 
[(u)»], is a k-eycle on Q(a, c), homologous to wu as we see from (12.13) upon 
interchanging (a, b) and (a, c). But (u), is on Q(a, b). Thus T leads to 
each class of H*(a, c). 

To show that the mapping 7' is one-to-one we have merely to show that 
the null class in H*(a, c) corresponds to the null class in H*(a, b). Suppose a 
k-cycle on H*(a, c) is the extension (z), of a k-cycle z on H“(a, b). If (2), ~ 0, 
we have 


O~ (n)y ~ 2 [on Q(a, b)}. 


Hence T is an isomorphism as stated, and the proof of (i) is complete. 

To prove the theorem we note that the operations replacing b by c in H ‘a, b), 
or of interchanging a and b lead to groups isomorphic with H*(a, b). We can 
thus successively replace (a, b) by (a, c), (c, a), (c, d) and obtain a group 
H*(c, d) isomorphic with H*(a, b). 

The proof of the theorem is complete. 

We state the following corollary of the theorem. 

Corouiary. The connectivities of Q(a, b) are topological invariants of | N |. 

Whether or not the connectivities of Q(a, b) are determined by arithmetic 
topological invariants of | N | now known is an interesting unsolved problem. 
It seems probable that they are not so determined. 

If | N | is a space homeomorphic with an n-sphere with n > 1, the connec- 
tivities R, of Q(a, b) are 1 if k = 0 mod n — 1, otherwise null. For example, 
for n = 2 and 3 respectively, the numbers R; form the sequences 1 1 1 --: and 
1010---. See M1, Theorem 15.1, p. 247. 
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The function F. Let 7 be an arbitrary curve of Q(a, b) and let L(n) be the 
length of 7. We set 


(12.14) 7) = 1+ L(@) 


understanding thereby that F = 1 when L is infinite. The function F is lower 
semi-continuous withO S$ F $1. Fore < 1 the subdomains F < c of Q(a, b) 
are compact in accordance with Theorem 12.1. A curve A of Q(a, b) will be 
called a homotopic pseudo-extremal or extremal respectively if \ is a homotopic 
semi-critical point or a critical point of F respectively. 

Our theory of functions F defined on spaces M is now applicable. 

We turn to the development of conditions under which a pseudo-extremal is 
always an extremal, that is conditions under which a critical pair (p, c) is of 
the form [p, F(p)]. 


V. Locatity Convex SPACES 


13. Elementary arcs. In this chapter we shall suppose that the space of §11 
is conditioned by two new postulates IV and V. In stating V we shall need 
the notion of an elementary arc. If p and gq are two distinct points of | N |, 
an are \ joining p to q will be termed elementary if it has the following two 
properties. It is simple, that is it is defined by a one-to-one continuous image 
p(t) of a line segment 0 S ¢ S a; a point r shall lie on ) if and only if 


(13.1) py = pr + rq. 


Postulates IV and V are as follows. 

IV. The space | N | shall be compact and connected. 

V. There shall exist a positive constant p such that whenever 0 < pq S p there 
exists at least one elementary arc which joins p to q. 

The: elementary are postulated in V will be denoted by (pq). We shall 
term V the postulate of local convexity and p the norm of convexity. We shall 
restrict ourselves to elementary arcs (pq) for which pq S p. 

The postulates I-V are independent as can be readily shown. They are 
satisfied on any ordinary closed manifold without singularity and with a suffi- 
ciently regular local representation. The elementary arcs are then geodesics, 
and geodesic arcs issuing from a point p and cut off at a sufficiently small 
distance p from p cover a neighborhood of p in a one-to-one continuous manner, 
palone excepted. On| N | however the elementary arcs issuing from a point p 
cover a neighborhood of p in general multiply as examples would show. 

We regard two elementary arcs as identical if they are similarly sensed and 
consist of the same points. With this understood we have the following 
theorem. 

THEOREM 13.1. There is at most one elementary arc joining a point p to a 
point q. 

Let \ and ¢ be elementary arcs joining p tog. Let r be a point on \ between 
pand q. Then pg = pr + rq. By virtue of the definition of an elementary 
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arc this implies that r lies on ¢ as well as A, and the proof of Theorem 13.1 js 
complete. 

The following theorem is of importance. 

THEOREM 13.2. Any subarc of an elementary arc (pq) is an elementary are. 

We shall begin by proving statements (1) and (2) below. In these statements 
and elsewhere we shall make the special convention that the elementary are 
joining a point p: to p= is the point p itself. 

(1). If r is a point on (pq), then the arc pr on (pq) is elementary. 

(2). If ris a point on (pq), then the arc rq on (pq) is elementary. 

To establish (1) we first note that pr S p since pr + r¢g = pq S p. It follows 
from V that there is an elementary arc (pr) joining p tor. Let s be any point 
on (pr). Then pr = ps + sr. Upon adding rq to both sides of this equality 
we find that 


(13.2) pr+rq = ps + sr + rq. 


Since r is on (pq) we can replace the left member of (13.2) by pg. In the right 
member it follows from postulate III that sr + rq 2 sq so that (13.2) takes 
the form 


(13.3) pq = ps + sq. 


Upon comparing (13.3) with postulate III we infer that pg = ps + sq so that 
s must lie on (pq). 

We have yet to show that s is on the are pr of (pq). To this end let (pq) 
and (pr) be respectively the one-to-one continuous images of the segments 
0<tS1land0 S221. Given any value of zon 0 S x S 1 there is deter- 
mined a unique point P on (pr) and, since P lies on (pq) as we have just seen, 
a unique value of ¢t. We denote this value of t by é(x). It is readily seen that 
the relation t = ¢(x) is one-to-one and continuous for z on 0 S$ x <1. Hence 
(pr) must coincide with the are pr of (pq), and (1) is proved. 

The proof of (2) is similar to that of (1) interchanging the rdéles of p and q 
in the proof of (1) and appropriately changing the order of the points involved. 
To establish the theorem let p, r, s, g be points on (pq) in the order written. 
By virtue of (1) the are ps of (pq) is elementary. It follows from (2) that the 
arc rs on (ps) is elementary, and the proof is complete. 

Theorem 13.2 leads to the following. 

THEOREM 13.3. If r = r(r), 0 S 7 S 1, defines an elementary arc joining 
p to q, the distance pr(r) is a continuous increasing function of r. 

The continuity of pr(r) is a consequence of the continuity of r(7) and the 
continuity of pr as a function of p andr. To show that pr(r) is an increasing 
function of 7 we note that 


(13.4) pr(r’) = pr(r) + r(r)r(r’) O<1r<7 


so that pr(r’) > pr(z). 
We are able to prove a theorem which is much stronger than Theorem 13.3. 
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To state this theorem let (pq) be an elementary arc with variable end points p, q. 
Let ¢ be a number between 0 and pq inclusive. Let the point r on (pq) at the 
distance t from p be denoted by f(p, q, t). Our theorem is the following. 

TurorEM 13.4. The point function f(p, q, t) is continuous for pq S p and 
0st Ss pg. 

Let (pn, Qn, tn) be a sequence of admissible sets (p, g, t) which tend to a 
set (p’, q’, t’) as a limit set. Set rn = f(pn, qn, tn). Since | N| is compact 
there is a subsequence of the points r, with a limit point r°. For simplicity 
we assume that the sequence r, itself converges to r°. We shall prove that 


(13.5) r = f(p’,q’, t). 


Upon letting n become infinite in the relation pPrgn = Patan + Trn We infer 
that 
(13.6) pq = pr + rq. 

We also have the relations 

(13.7) pr = lim par, = limt, = ¢’. 

From (13.6) and (13.7) we see that r° is the point on the elementary are p’r° 
which is at the distance ¢’ from p’. Hence (13.5) holds, and the theorem is true. 

The following theorem depends upon the compactness of | N | and postu- 
lates I-III. 

THEOREM 13.5. The absolute distance | pq| approaches zero uniformly with 
pq, and the absolute diameter of any curve approaches zero uniformly with its 
length. 

Suppose the first affirmation of the theorem false. There will then exist a 
sequence of pairs of points (p,, qn) whose absolute distances are bounded 
from zero but whose distances png, tend to zero as n becomes infinite. There 
will exist at least one limit pair p*, g* of the pairs p,, q,. For such a pair 
p*q* = 0 so that p* = q*. But | p*q*| ¥ 0 since the distances | p,q, | are 
bounded from zero. From this contradiction we infer that | pq | approaches 
zero uniformly with pq. 

The concluding affirmation of the theorem follows from the definition of 
length and the first statement in the theorem. 

Recall that a metric space M is termed separable if there is a subset of points 
of M whose closure is M and which is at most countably infinite. We state 
the following theorem. 

TaEorEeM 13.6. Under postulates I-V the space Q(a, b) is separable. 

The space | N | is compact and therefore separable. Let (gq) be a set of 
points of | N | at most countably infinite with closure |_N |. We suppose that 
the points a and b belong to (q). By a broken extremal ¢ is meant a curve 
composed of a finite succession of elementary arcs. The end points of these 
elementary arcs will be called the vertices of ¢. Let H be the set of all broken 
extremals which join a to b and whose vertices belong to (q). It is clear that 
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the set H is at most countably infinife. Corresponding to an arbitrary curve \ 
of Q(a, b) there is a broken extremal ¢ within a prescribed distance e of \. We 
suppose ¢ so chosen that the lengths of its elementary arcs are at most p/2. 
Let the successive vertices of ¢ be designated by a, pi, ---,Pn,b. We replace 
the vertices p; by points q; of (gy) so near the respective points p; that the 
successive points a, g1, °°: , Qn, 6 can still be joined by elementary arcs, and 
so near p; that the distance of the resulting broken extremal 7 from ¢ is at 
most e. The curve 7 belongs to the set H, and An S 2e. 
The proof of the theorem is complete. 


14. Metric extremals. We begin this section with the following theorem. 

THEorEM 14.1. The length of an elementary arc (pq) equals pq and is a proper 
minimum relative to the lengths of all curves which join p to q. 

Let pi, -+* , Pn be a set of points which appear in the order written on (pq). 
Any segment of an elementary arc is an elementary arc. It then follows 
inductively for n = 1, 2, --- that 


pq = ppi + Pops + -*> + Dad. 
We infer that the length of (pq) equals pq. 

Let \ be any curve joining p to g. It follows from Theorem 11.3 that 
L(A) 2 pq. It remains to show that L(A) > pq when d is not the elementary 
arc (pq). 

The proof falls into two cases. 

Case I. The curve \ contains a point s not on (pq). 

Case II. Each point of d is on (pq), but X # (pq). 

In Case I we see that L(A) = ps + sq > pq, and the proof is complete. 

In Case II there must be distinct points r and s on \ and on (pq) which 
appear in the order rs on (pq) but in the order sr on X. Then 


L(\) 2 ps + sr +1q > ps + rq > ps + sq = 74, 


and the proof of the theorem is complete. 

A metric extremal d shall be the continuous image on | N | of a closed or open 
interval T of the ¢-axis such that every inner point ¢ of JT is an inner point of a 
closed subinterval of 7 whose image on ) is an elementary arc. A curve which 
is not a metric extremal will be termed metrically ordinary. We continue with 
the following theorem. 

THEOREM 14.2. Let a and b be two points on | N |. Under the hypothesis of 
local convexity a curve of minimum length joining a to b is a metric extremal. 

The successive points of a sufficiently fine partition of \ can be joined by 
elementary arcs thereby forming a curve of finite length. Hence the length 
of \ must be finite. Moreover each arc 7 of \ with a length less than p must 
be an elementary arc. For if p and q are the end points of 7, pq S L(n) <P 
in accordance with Theorem 14.1 so that p and q can be joined by an elementary 
arc (pq). If 7 is not an elementary arc pq < L(n) by virtue of Theorem 14.1 
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and \ can be shortened by replacing 7 by (pq). It follows that d is a metric 
extremal, and the proof is complete. 

Homotopic extremals and pseudo-extremals were defined in §12 before the 
hypothesis of local convexity was introduced, while metric extremals were 
defined after this hypothesis was introduced. The principal theorem of this 
section is the following. It includes Theorem 14.2 as a special case. 

TurorEM 14.3. Under the postulates I-V the class of homotopic extremals on 
Q(a, b) is identical with the class of homotopic pseudo-extremals on Q(a, b) and 
each such extremal is a metric extremal. 

The proof of this theorem will require the introduction of a lemma and a 
basic deformation 6. We proceed with the necessary definitions and proofs. 

Admissible sets r. Let be a curve on | N | with end points a and b. If r 
is a sufficiently large integer, there is a set of r — 1 points on \ dividing \ into 
a sequence of arcs 


(14.1) w= (i, °°: , Ay) 


on each of which the distance from the initial point to the terminal point is at 
most p. A set w of curves \, obtained in this manner from a curve \ joining 
a to b on M will be termed admissible. We term \,; the 7** component of z. 
Corresponding to an admissible set + we denote the curve \ which 7 deter- 
mines by A(z). 

Let 7 be an admissible set (A;). We denote the elementary are which joins 
the initial point of A; to the terminal point of A; by r\*. The set of curves A? 
will then be denoted by z*. The set x* is determined by 7. The curve \(z*) 
is a broken extremal. 

The following lemma is a broad generalization of Osgood’s Theorem.” In it 
we refer to the function F(A) defined in §13. 

Lemma 14.1. For admissible sets x which possess a fixed number r of com- 
ponents and for which the distance of (x) from X(x*) is not less than a positive 
constant e the value of F on d(x) exceeds the value of F on X(x*) by an amount 
which is bounded away from zero. 

In proving this lemma admissible sets 7 will function as points in a metric 
space. The distance between two admissible sets 7, 


w= (M,°°°, Ar); mw = (h,°°:, Ay) 


with the same number of components shall be defined as the maximum distance 
MiA; and denoted by rz’. 





'® See Tonelli, l.c. 

* Cf. Fréchet Sur une représentation paramétrique intrinséque de la courbe continue 
la plus générale Journal de Mathématique t. IV, 1925, pp. 281-297. This very interesting 
tepresentation could not be used in the present paper because it lacked the property (d) 
of our representations. 

"Fréchet Sur quelques points du calcul fonctionnel, Rendiconti del Circolo Matematico 
di Palermo, vol. 22 (1906), pp. 1-74. 
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We consider a fixed positive integer r and observe that L[A(x*)] < rp for all 
admissible sets + with r components. Hence there is a constant m dependent 
on r such that 


F[A(r*)] S m <1. 


Let k be an arbitrary number between m and 1. We divide admissible sets x 
with r components into two classes I and II according as 


I. F[A(x)] > k, II. F[A(r)] S k. 
We then introduce the function 
| H(r) = F[X(m)] — F[(a*)). 
If w is in Class I, 
(14.2) H(xr) >k—m> 0. 


We turn to Class II. There is a constant c dependent on k such that each 
component \; of an admissible set 7 in Class II has a length at most c. The 
set of all curves on | N | with lengths at most c forms a compact space @ (the 
proof of this is similar to the proof of Theorem 12.1). Let G’ denote the product 
space in which G is an r-fold factor. The space G’ is compact. For r fixed, 
the sets which are admissible and belong to Class II, and for which the dis- 
tance \(r)A(x*) = e form a closed subset Go of G’. 

The function H(z) is lower semi-continuous. For F(A) is lower semi-continu- 
ous in \, A(z) varies continuously with admissible sets + and F[A(z*)] is a 
continuous function of admissible sets . By virtue of Theorem 14.1, H(z) is 
positive on G). At some point 7 of Gp , H(m) then takes on a positive absolute 
minimum. That is 


H(r) 2 H(m) > 0 (x C Go). 
Combining this result with (14.2) we find that 
H(m) = min [k — m, H(m)] > 0 


for the sets z of the lemma. 

The proof of the lemma is complete. 

We shall define an F-deformation 0(n, m) of the neighborhood of an arbitrary 
curve 7 of Q(a, b). To that end we shall need a representation of curves ) of 
Q(a, b) in terms of y-length. See §11. Let u(A) be the y-length of ) and let « 
be a parameter which ranges from 0 to 1 inclusive. Let the point q on ) at 
which the u-length of \ equals ad\(u) be designated by 


(14.3) q aut qa, a). 
The function q(A, a) is conditioned by properties (b), (d) and (e) of u-length 
as given in §11. 

The deformation 0(n, m). Let m be a positive integer so large that 7 admits 
a partition into m successive subares with the following properties. The diam- 





mh -_- nn Kr 


es 


on 








all 
ent 


ST 


ach 
The 
the 
uct 
ed, 
lis- 


nu- 
sa 
) is 
ute 


ary 
. of 
t a 
at 


gth 


nits 





FUNCTIONAL TOPOLOGY AND ABSTRACT VARIATIONAL THEORY 443 


eter of each of these arcs shall be less that p/2. If » is not an extremal at 
least one of these subares shall not be an elementary arc. Let V be a neighbor- 
hood of » on Q(a, b) and X a curve on V. The curve X shall be divided into m 
successive subares on which the variation of the parameter a in q(A, a) in (14.3) 
shall equal its variation on the corresponding subares of y. It follows from 
property (e) of u-length in §11 that the neighborhood V can be chosen so small 
that the diameter of each of the m-subarcs of ) is less than p/2. We suppose V 
so chosen. 

Let w be the z** subare of A. We shall define 6 for w, understanding that 
the deformation 6 of \ is obtained by simultaneously applying 6 to each of the 
subares w of X. In the deformation @ the time ¢ shall range from 0 to 1 inclu- 
sive. Lett, = 2.",n = 0,1, 2,---. We shall first define 6 for the times t, . 

For t = t, the deformation @ shall replace w by a broken extremal w, whose 
end points are the end points of w and whose successive vertices divide w into 2” 
successive segments of equal variation of the parameter a of \. Let (a), denote 
the ordered set of parameter values a at the end points and points of divi- 
sion of w. 

Let the curve which replaces w at the time ¢ be denoted by w'. We shall 
now define w’ when ¢ is on the interval 


ti <t< tx. 


Let a’ and a” denote any pair of successive values of @ in the set (@),1. In 
the set (a), there is just one value a* of a between a’ anda”. Let g denote 
the elementary are of w, which joins the point a* on w to the point a” on X. 
Let ¢ divide the interval (é, , tn1) in an arbitrary ratio with respect to length. 
Let p(t) denote the point on g which divides g in the same ratio with respect 
to length. Corresponding to each pair a’, a’’ chosen as above we replace the 
arc £ of w, with end points at the points a’ and a’ on w and intermediate vertex 
at a* on w by the broken extremal with end points of £ and one intermediate 
vertex p(t), thereby obtaining the curve w’. 

The definition of 6(n, m) is complete. 

In showing that @ is an F-deformation we shall use the concept of an adjoin 
of a point set A of |N |. By the adjoin of A is meant the set of points on all 
elementary arcs whose end points lie on A. By the 2-fold adjoin of A is meant 
the adjoin of the adjoin of A. We shall make use of the following lemma. 

Lemma 14.2. Corresponding to an arbitrary positive constant e there exists a 
positive constant 6 such that the diameter of the 2-fold adjoin of A is at most e 
whenever the diameter of A is at most 6. d 

This lemma is a consequence of our theorems on elementary arcs. 

To prove that @ is an F-deformation of the neighborhood V of » we must 
establish the following: 

(i). The curve \. replacing \ at the time t varies continuously with d and t for d 
on any compact subset S of V and for0 < t < 1. 

(ii). The deformation 6 admits a displacement function on S. 
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Let ¢ denote an arbitrary point of S and 7 an arbitrary value of ¢ on the 
interval 0 < ¢ <1. Regarding X, as defined for the moment only for curves ) 
on S we shall prove that d; is continuous in ) and ¢ at the pair (¢, 7). We shall 
distinguish two cases: I. r = 0; II. 7 > 0. 

We first consider Case I. The point g on has been given the representation 
q = q(\, a). The function g(A, a) is continuous in its arguments in accordance 
with (d) of §11. It follows that for \ on S the 2” successive arcs k into which w 
was divided in defining @ will be uniformly small in diameter provided n > N, 
and N is a sufficiently large integer. Let e be an arbitrary positive constant. 
It follows from Lemma 14.2 that if N is sufficiently large the 2-fold adjoin of 
two successive arcs k will have a diameter at most e/2. In defining 8, \, was 
obtained from \ by replacing (certain) pairs k’, k’’ of adjacent ares k of \ by 
two successive elementary arcs of the 2-fold adjoin of k’.+ k”. Fort <2” 
each of the above arcs k involved in the construction of \; belongs to at least 
a 2"-fold partition of an are w. Set 6 = 2”. From the choice of N we infer 
that 


(14.4) MAS 5 (AC S,t <8). 
We next observe that 
(14.5) ME SMAEN 
so that we see from (14.4) and (14.5) that 
uso Se when [Aft < e/2;\’ C S;t < Gj. 


The continuity of \, is established at (¢, 7) when 7 = 0. 

We turn to Case II. For¢-2= 7 and sufficiently near 7, or for ¢ S 7 and suffi- 
ciently near 7, the curve ,; is a broken extremal whose vertices vary continu- 
ously with \ and ¢ for \ on S in accordance with (d) of §11. Hence ), is con- 
tinuous at (A, 7) in Case II. 

The proof of (i) is complete. 

We shall continue with a proof of (ii). 

Let \ be a curve of S. Let ¢’ be an antecedent of ¢”’ on the trajectory T 
of \ under 6. Let e be a positive constant. Setting F(¢’) — F(¢’) = AF we 
must show that whenever {’¢” > e, 


(14.6) AF > &(e) 


where 6(e) is a positive constant which depends only on S and e. 

Let ¢, denote the image of \ at the time ¢,. We shall say that the pair 
¢’, ¢”” is in Class 1 if for no n is ¢, on T between ¢’ and ¢”, and if {’¢" > &. 
We shall first establish (14.6) for pairs ¢’, ¢’” in Class 1. 

As constructed in the definition of 6, ¢” i is a succession of r elementary arcs ti 
The curve ¢’ is a succession of r curves ¢; whose end points are joined by the 
respective elementary arcs ¢;. We can apply Lemma 14.1 to ¢’ and ¢” identi- 
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fying ¢’ with \(m) and ¢” with A(x*) and infer the existence of a function w(r, ¢) 
such that 
(14.7) AF > w(r, e) > 0 (ee > e). 


Relation (14.6) will follow from (14.7) provided the integer r in (14.7) is bounded 
for pairs ¢’, ¢”” in Class 1. The integer r will fail to be bounded only if there 
are pairs ¢’, ¢’” in Class 1 arbitrarily near \, as one sees upon representing the 
curves \ of S in the form g = q(A, a) of (14.3), and recalling that S is compact. 
But pairs ¢’, ¢” arbitrarily near \ would be arbitrarily near each other contrary 
to the hypothesis that ¢’f’’ > e. We conclude that r in (14.7) is at most an 
integer R. We can accordingly take 6(e) in (14.6) as the minimum, say 4;(e), 
of the numbers 


w(1, e), w(2, e), --+ , w(R, e). 


Thus (14.6) holds for pairs ¢’, ¢” in Class 1. 

We turn to pairs ¢’, ¢”” not in Class 1. 

As previously we admit pairs ¢’, ¢’ such that ¢’¢’’ > e and such that ¢’ 
is an antecedent of ¢’’ on a trajectory 7’ emanating from a curve \ of S. If ¢’ 
and ¢”’ are not in Class 1 there exist integers p and s such that the images 


(14.8) Cpu ? Fr Sp ’ fs ’ my Ce 


of \ appear in the order written on JT. We admit the possibility that consecu- 
tive curves in (14.8) may be identical. The function F(f) decreases mono- 
tonically as ¢ moves along T' in the sense of increasing ¢t. Hence the left mem- 
ber of(14.6) is at least as great as each of the three numbers 


F(S’) — FG), FG) — FG), Fs) — FO"). 


Admissible pairs ¢’, ¢’” not in Class 1 will be assigned to Classes 2, 3, or 4 
according as 


2. §%p2 3? 3. Spb. 2 t 4.65" 2 3 
Since (’¢” > e each pair ¢’, ¢” will belong to at least one of these classes. 
Proceeding as under Class 1 we can infer the existence of a function 6;(e), ¢ = 
2,3, 4, such that AF > 6,(e) when ¢’, ¢’” is in Class i. 

For a fixed e let 6(e) denote the minimum of the numbers 6;(e) for i = 1, 2, 3, 4. 
For this 6(e), (14.6) holds as stated, and the proof of (ii) is complete. 

The deformation 6(n, m) is accordingly an F-deformation of the neighbor- 
hood V of 7. 

Proof of Theorem 14.3. Let » be a curve of Q(a, b) and let 6(n, m) be a corre- 
sponding F-deformation of curves \ on a neighborhood of 7. Let m and Ay 
be respectively the final images of 7 and \ under @. We shall begin by proving 
the following statement. 
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(i). Corresponding to an arbitrary positive constant e there exists a positive 
constant 6 such that for \n < 6, 


(14.9) F(M) S F(m) + e. 
The final image \; of » is a succession of m elementary arcs. Let (p) = 
(po, *** » Pm) be the successive end points of these elementary arcs. Designate 


the set (p) determined by m by (q). Let the distance (p)(q) be defined as in 
§12 as the maximum of the distances | piq:|. According to the construction 
of 6 the points p; divide \ in the same ratios with respect to y-length as the 
points q; divide ». For » fixed it follows from property (d) of u-length in §11 
that the distance (p)(q) tends to zero with An. But F(A,) may be regarded 
as a function ¢(p) of the set (p) determining (A,). The function ¢(p) is con- 
tinuous in (p) at least if (p)(q) is so small that (p) determines a sequence of 
elementary ares. Hence 


9(q) S o(p) +e 


provided (p)(q) is sufficiently small. Statement (i) follows directly. 

We continue with a proof of (ii). 

(ii). A homotopic pseudo-extremal is a homotopic extremal. 

Let » be a semi-critical point in the space Q(a, b) at the level c. We shall 
show that F(n) = c. It follows from the lower semi-continuity of F that either 
F(m) = cor F(n) < ¢ — e& where e, > 0. If m is the final image of 7 under 4 
and F(n) < c¢ — e, it would follow with the aid of (14.9) that 


(14.10) F(u) S F(m) +e S F(n) te <c-—ate (An < 4). 
If 6 in (i) is sufficiently small e can be taken less than e; and (14.10) implies that 
F(u) <e — (a +e) <e (An < 4). 


We can thus F-deform the neighborhood \y < 6 of 7 onto a domain definitely 
below c. The limiting pair (7, c) must then be homotopically ordinary con- 
trary to hypothesis, and we conclude that (ii) is true. 

We conclude the proof of Theorem 14.3 by establishing (iii). 

(iii). A homotopic extremal is a metric extremal. 

To prove (iii) it will be sufficient to show that if 7 is metrically ordinary 1 
is homotopically ordinary. When 7 is metrically ordinary the m arcs into 
which 7 is initially divided in defining 6(n, m) are so chosen that at least one are 
is not elementary. It follows that F(m) < F(») and upon making use of the 
arbitrariness of e it follows from (14.9) that @ F-deforms a sufficiently small 
neighborhood of 7 onto a domain definitely below F(n). Thus 7 is homo- 
topically ordinary. 

The proof of Theorem 14.3 is complete. 


15. Properties of Q(a, b) and of metric extremals. In §9 we have defined 
locally F-connected spaces. We shall here prove the following theorem. 
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TuroreM 15.1. Under postulates 1-V the space Q(a, b) is locally F-connected 
for all orders n. 

Let be a curve of Q(a, b). To establish the theorem it will be sufficient to 
establish the following. 

(a). Corresponding to » and a positive constant e there exists a positive constant 5 
and a deformation of ns on ne tnto a curve m On 4. , in which any compact subset S 
of mon F < ¢ 2 F(n) ts continuously deformed on F S c + e. 

We shali apply the deformation 6(n, m) of the preceding section to 7. Recall 
that the definition of @ began with the division of » into m successive ares. 
If these ares are sufficiently small in diameter and V is a sufficiently small 
neighborhood of », @ will deform V on the e/2 neighborhood of ». We suppose 
sand V sochosen. Let A be a curve on V and let m and , be final images of 7 
and \ respectively under #. 

Each broken extremal \; which is sufficiently near 7; can be deformed into m 
as follows. The vertex q; of A; will be joined to the corresponding vertex p; 
of m by an elementary are E; , and as ¢ varies from 0 to 1, q; shall be replaced 
by a point p, which divides E; in the same ratio with respect to length as that 
in, which ¢ divides the interval (0, 1). The broken extremal }, will thereby be 
deformed into m. Denote this deformation by A. 

Let 6 be a positive constant such that ns < V. Let H be the final image 
of m; under 6. The diameter of H tends to 0 with 6. We suppose 6 so small 
that A deforms H on yn, andon F Sc+e. 

The deformation @ applied to 7; , followed by A applied to H, yields a deforma- 
tion of 4s which satisfies (a). Theorem 15.1 follows directly. 

Referring to Theorem 9.2 we infer the following corollary of Theorem 15.1. 

Corottary 15.1. Under postulates I-V the lengths of extremals of Q(a, 6) 
which correspond to inferior cycle limits of non-bounding k-cycles, have no finite 
cluster value. 

Analytic functionals. We shall give a brief indication of how the preceding 
theory can be applied to functionals which are analytically defined. We suppose 
that the underlying space M is compact, connected, locally analytic, and n-dimen- 
sional. To be locally analytic M shall satisfy the conditions of a space locally 
of class C* as defined as §9, together with the condition that the transforma- 
tions of codrdinates z; = z,(x) shall be defined by functions z,(x) which are 
analytic. In each local codrdinate system we suppose that we have given an 
invariant function f(z, 1) defining an integral 


J= | 1. é) dt 


asin M1, p. 111. We suppose that f(z, r) is analytic, positive, positive homo- 
geneous of order 1 in the variables (r), and positive regular. See M1, p. 121. 
We presuppose the analyticity of M and f(z, r) for the sake of simplicity, 
although it will be clear to the reader that less stringent conditions would suffice. 

As is well known there will exist a positive constant p with the following 
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properties. The extremals of J which issue from a point p of M and on which 
0 < J S pwill form a field covering a neighborhood of p in a one-to-one manner 
p alone excepted, with no conjugate points of p on the extremal ares 0 < J < . 
We introduce a metric N in which the distance pq equals the absolute minimum 
of J along rectifiable curves which join p to g. With N and p so defined, we 
proceed as in the abstract variational theory noting that conditions I-V are 
satisfied. 

As previously we are concerned with the space Q(a, b). The integral J defines 
“length” and leads as at the end of §12 to the functional F on Q(a, b). A 
metric extremal will here be locally regular and analytic, and will be termed a 
differential extremal. We shall prove the following theorem. 

THEOREM 15.2. Let n be a differential extremal joining a to b on which there 
are k conjugate points of a, but on which b ts not conjugate to a. The extremal g 
is a homotopic critical set whose j** type number is 6; . 

In proving this theorem we shall make use of a subset of curves of 2 termed 
canonical curves and defined as follows. Let 


(15.1) : G,Q,,°** , Ami, 


be a sequence of points on 7 which divide 7 into m successive elementary ares 
with diameters less than p/2. Let M* be a regular analytic (n — 1)-manifold 
cutting 7 at the point a, without being tangent to 7 at a,. Broken extremals 
of Q whose intermediate vertices lie on the successive manifolds M* will be 
termed canonical, and a set of all such curves with vertices on neighborhoods 
of the respective points a, will be denoted by S. The value of F on curves \ 
of S will be an analytic function g(u) of the local codrdinates of M neighboring 
the respective vertices a,. As shown in M1 (ef. p. 228) the point (u) = (w) 
which determines 7 is a non-degenerate critical point of g of index k. Upon 
referring to Theorem 10.4 we infer the truth of the following. 

(a). If S consists of canonical curves determined by points (u) sufficiently near 
(uo) the differential extremal n is an isolated homotopic extremal rel S, and the 
j* type number of » rel S equals 6}. 

The curve 7 is isolated among differential extremals on Q at the level ¢ = F(1). 
There are accordingly no homotopic extremals on @ at the level c in a suff- 
ciently small neighborhood of 7, 7 excepted as we shall see. 

We continue with a proof of statement (8). 

(8). Corresponding to the set S of (a) there exists a separate neighborhood U of 1 
on Q(a, b) which can be deformed into a subset of curves of S without increasing F 
beyond its initial value on each curve of U. 

We shall make use of the deformation 6(n, m) of the preceding section under- 
standing that the m arcs into which 7 is initially divided in defining 0(1, ™) 
are the m arcs whose vertices are given by (15.1). We take U so small that U 
is on the domain upon which 6(y, m) operates as an F-deformation. Let 4 
be a curve of U and u the final image of \ under 0(n, m). The curve » will be 
a broken extremal with intermediate vertices neighboring the respective points 
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a, of 7 If U is sufficiently small there will be a unique curve in S whose 
successive vertices lie on ». We denote this curve by ¢(u). 

The deformation Z. Under Z we shall deform yu into ¢(u). We thereby 
replace each intermediate vertex p of u by a variable vertex p,. We cause p 
to move from p along uw to the correspondingly numbered vertex r of ¢(u) in 
such fashion that the distance along » from p to p, changes at a rate equal 
to the distance along » between p and r. The curve uz is thereby deformed 
into ((u). Moreover F(f) S F(u). The deformation 6(y, m) followed by the 
deformation Z taken with a sufficiently small neighborhood U of 7 will 
satisfy (8). 

We continue with a proof of (y). 

(y). A j-cap u associated with n rel S is a j-cap associated with n rel Q. 

Since u is associated with 7 rel S, wu is c-homologous on S to a j-cap on an 
arbitrarily small neighborhood of 7, in particular to a j-cap v on U. If v were 
c-homologous to 0 on 2, v would be c-homologous to 0 on U, by virtue of Theorem 
10.1 (b). Let v* be the final image of v under the deformation D of (8). It is 
clear that v* is c-homologous to 0 on § if v is c-homologous to O on U. But 
v and v* are c-homologous on S if U is sufficiently small as one sees with the 
aid of a deformation similar to Z operating on the formal deformation 
chain defined by v under D holding v and v* fast. We infer that wu is c-homol- 
ogous to 0. From this contradiction we infer the truth of (7). 

We come finally to a proof of (6). 

(5). Any j-cap u associated with n rel Q is c-homologous on Q to a j-cap asso- 
ciated with n rel S. 

The k-cap u is c-homologous on Q to a k-cap on the neighborhood U. It 
follows from (8) that u is c-homologous on @ to a j-cap on S, and the proof 
of (6) is complete. 

We turn to the proof of the theorem. It follows from (vy) and (6) that a 
maximal group of j-caps associated with 7 rel S is a maximal group of j-caps 
associated with » rel 2. It then follows from (a) that the dimension of such 
a group is 6}. 

The proof of Theorem 15.2 is complete. 

With the aid of this theorem and theorems of a like nature the reader can 
make connections with the principal results in M1. 


Tue InstituTE For ADVANCED Srupy. 
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A NOTE ON A PREVIOUS PAPER “NEW FOUNDATIONS OF 
PROJECTIVE AND AFFINE GEOMETRY” 


By Franz Att AND Kart MENGER 


(Received January 25, 1937) 


1. This first paragraph is an erratum to the paper cited above. In §2, 
postulate IV (p. 458, referring to the symbol € which is only afterwards intro- 
duced, has to be replaced by the two propositions (a) and (a’) of p. 458. 
Nothing else has to be changed in §2 since it is these hypotheses which are 
actually used. In §3 the remark of p. 461 that law 5 is equivalent to IV must 
be replaced then, by the statement that law 5, though stronger than IVa and 
IVa’, still holds in projective as well as in affine spaces. 


2. The next two paragraphs are remarks by F. Alt on the paper cited above. 

It is obvious that law 5 implies [Va and IVa’. It may be shown by an 
example that the converse is not true. We consider a class which besides V 
and 9 points 1, 2, 3, 4, 1’, 2’, 3’, 4’, 0, contains 12 “lines” consisting of the points 
(0, 1, 2), (1, 3), (1, 4), (2, 3), (2, 4), (3, 4), (0, V’, 2’), (1’, 3’), (1’, 4’), (2’, 3), 
(2’, 4’), (3’, 4’), and 8 “planes” consisting of the points (0, 1, 2, 3), (0, 1, 2, 4), 
(1, 3, 4), (2, 3, 4), (0, I’, 2’, 3’), (0, Y’, 2’, 4’), (1’, 3’, 4’), (2’, 3’, 4’) and U. 
For two elements A, B of this class we define A + B as the element of least 
dimension containing all the points of A and of B (e.g. we have 1 + 1’ = 
1 + 2’ = U), and A-B as the element consisting of the points contained as 
well in A asin B. Then the class satisfies Postulates I, II, III, IVa, IVa’, V, 
while law 5 does not hold. This is seen for A = 0, B = (0,1, 2), C = (0,1, 2, 3). 

Another remark may be added regarding this example: If we consider the 
subclass consisting of all parts of the plane (0, 1, 2, 3), i.e. V, 0, 1, 2, 3, (0, 1, 2), 
(1, 3), (2, 3), (0, 1, 2, 3) then we notice that this sub-class does not satisfy 
Postulate IVa. This is seen for A = (0, 1, 2), B = 0. On the contrary, 
classes which satisfy laws 1-6 are “hereditary’’, ie. for each element A the 
class of all parts of A satisfies laws 1-6 itself, A playing the réle of U. 


VIENNA 





1 Annals of Mathematics, vol. 37, (1936), pp. 456-482. 
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p-ADIC ANALYSIS AND ELEMENTARY NUMBER THEORY 
By Max Zorn* 
(Received May 1, 1936) 


Introduction. In 1933 I. Schur’ communicated a far reaching generalization 
of Fermat’s theorem in elementary number theory. He introduced as the 
derivative a, of a sequence a, with respect to the number p the sequence 


, Qn+1 — An 
(0, 1] an = pet Fi ° 
If p is a rational prime, a an integer, and a, = a’”, then by Fermat’s theorem 
. -" es a” 
(0, 2) an = pt 


is integral. 

Schur proved the generalization: Not only the first derivative (0, 2), but also 
the following derivatives up to the (p — 1)** of the sequence a”” are integral, if a is 
prime to p. , 

His proofs however are not very simple. It is easy to see that the second 
derivative is integral (cf. (4.9)); but my first attempts to generalize the pro- 
cedure in this case were not successful. 

Later, Mr. Rothgiesser gave an inductive proof for all the statements in 
Schur’s paper.” 

The present paper is concerned with the discussion of Schur’s theory. 

First of all I call attention to the fact that this theory is valid in the p-adic 
field belonging to the prime number p and that the “p-adic” theorem implies 
again the “rational” one. 

From the definition of limit in the p-adic sense we see that the sequence a, is 
certainly convergent if the derived sequence a, is integral. Hence the Fermat- 
Schur theorem establishes a series of functions of the p-adic number a, namely 
the limits 
(0, 3] D(a) = lim (a”)"”. 


nwo 
. 


It turns out that it is much more convenient to study the related sequence 


—1) p” 
(p—1) p — 


(0,4) La) = 


and its derivatives. It is easy to return to the original sequence a” (cf. §8). 
eA SUR ett | 

. Sterling Fellow. Presented to the A. M. S. in October 1935. 

' Sitzungsberichte der Preuss. Ak. d. Wiss., Math. N. K1., 1933, p. 145. 

? Letter to Prof. Schur., Sept. 9, 1933. 
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We show that the limits of the derivatives of [0, 4] exist and are given by the 
formula 


i iy _ (p' — 1) (p"' — 1) --- (p—1) ( lag ea 
Oo (7 +1)! > ; 


Here the function log a” " may be defined either by the power series for the 
log or by (0, 5) itself for r = 0: 














p—l 
(0, 6) fie dahed ie ee 
(0, 5) has its counterpart in real analysis: 
da’ 
z ‘al ] r 
(0, 7) | oe (a | (log a) 
which for r = 1 contains 
i 
e ~1 
(0, 8) lim —>— = log a. 


n 


But the integrity of a, means more than simple p-adic convergence of Gy; it 
gives information about the approach to the limit, or in other words congruence 
relations for the a,. We obtain a system of congruences, which generalize 
Schur’s results. I mention here only the 

TuHeorEM. If pis odd and x = 1 (mod. p), and 


xz? —1 
prt , 


[0, 9] xX.= 


then 





oy — (p’ — 1) (p™" — 1) -:- (p— 1) pent ® 
(0, 10) xX,’ = Ga! X*, (mod. p”) 
forr < p;forr < p — 2 the congruences are valid mod. p”™’. 

The arrangement given in this paper is not intended to be a systematic simpli- 
fication of Schur’s theory. Even the new congruence (0, 10) would be only a 
formal justification, since doubtless there will be a satisfactory proof within the 
framework of the usual congruence arithmetics. 

The interpretation in p-adic analysis shows the possibility of vast generaliza- 
tions and of a direct way to find them. The general principle of this method 
would be: 

Ordinary analysis has amassed a great stock of identities between power series. 
Many of these are valid in p-adic analysis too. But here an identity between power 
series yields congruences between partial sums.° 





* Cf. Eisenstein, Crelle, vol. 39, page 231. 
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In our case the basic identity is the addition theorem of the logarithm, hence, 
in the last instance, Abel’s theorem. 

The next step in this direction would be an investigation of elliptic functions, 
where I expect relationships with the theory of complex multiplication. 

Finally, with a certain hesitation, I shall point out another reason for the 
desirability of a closer study of special p-adic functions. 

Various facts in the theory of algebraic numbers indicate the possibility of an 
“analytic continuation” of p-adic numbers. If such a “systematic identification 
of (algebraic and transcendental) p-adic numbers in different prime spots” 
exists at all, then probably the values of an analytic function, e.g. 





1 > (1 att 


loga = 
og i—>74 


for a fized a would form a “monogenic system of p-adic numbers.” 

In this way the study of special p-adic functions might furnish material for 
later structural analysis. 

Whether or not such monogenic systems of p-adic numbers do permit an 
intrinsic description is certainly an important problem. The relation to number 
theory proper is suggested by the formula 


p-l 


Legendre symbol (:) = lima?” 


ne 


n 


The law of reciprocity, which describes the dependence of this function on p, 
has long been considered as an analogue or as a manifestation of an analogue of 
the Cauchy integral theorem.*'® 


1. The results of I. Schur. 

[1, 1] Derinrrron: Given a sequence of numbers a, a1, --- @n--- and a 
number p ~ 0. Then the (Schur-)derivative Aa, of this sequence is another 
sequence 
On+1 — An 


/ 
a, = Ad, = pen 


Higher derivatives are defined as usual: 
A’a, = aS” = A(a’””) = A(A”“a,). 


(1,2) Fermat’s Tozorem. If a is an integer and p a prime, then 
n+1 n 
n a” = a? 
Aa” = pen 
consists of integral numbers. 
LG 
‘Cf. Hilbert, Mathematische Probleme, Coll. Pap. vol. III., page 313. 


SA, Weil located several years ago the analogue more precisely. It turns out that 
Riemann’s bilinear period relation is the exact correspondent of the law of reciprocity. 
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We show that the limits of the derivatives of (0, 4] exist and are given by the 
formula 





sill saataes (+i)! Pm ; 


Here the function log a” * may be defined either by the power series for the 
log or by (0, 5) itself for r = 0: 








p—l 
(0, 6) fen bodes, we Oe. 
(0, 5) has its counterpart in real analysis: 
dad’ 
z _ ] r 
(0,7) | = (a 12 (log a) 
which for r = 1 contains 
J 
a -1 
(0, 8) lim | = log a. 
i's 


But the integrity of a, means more than simple p-adic convergence of ay; it 
gives information about the approach to the limit, or in other words congruence 
relations for the a,. We obtain a system of congruences, which generalize 
Schur’s results. I mention here only the 

THEeorEM. If pis odd and x = 1 (mod. :p), and 


x? —1 
pr ? 


[0, 9] xX.= 


then 





o — (p — 1) (p" — 1) +++ (P—D pet ® 
(0,10) x=? Res =H ud X7** (mod. p") 
forr < p;forr < p — 2 the congruences are valid mod. p”™’. 

The arrangement given in this paper is not intended to be a systematic simpli- 
fication of Schur’s theory. Even the new congruence (0, 10) would be only a 
formal justification, since doubtless there will be a satisfactory proof within the 
framework of the usual congruence arithmetics. 

The interpretation in p-adic analysis shows the possibility of vast generaliza- 
tions and of a direct way to find them. The general principle of this method 
would be: 

Ordinary analysis has amassed a great stock of identities between power series. 
Many of these are valid in p-adic analysis too. But here an identity between power 
series ytelds congruences between partial sums.* 





* Cf. Eisenstein, Crelle, vol. 39, page 231. 
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In our case the basic identity is the addition theorem of the logarithm, hence, 
in the last instance, Abel’s theorem. 

The next step in this direction would be an investigation of elliptic functions, 
where I expect relationships with the theory of complex multiplication. 

Finally, with a certain hesitation, I shall point out another reason for the 
desirability of a closer study of special p-adic functions. 

Various facts in the theory of algebraic numbers indicate the possibility of an 
“analytic continuation” of p-adic numbers. If such a “systematic identification 
of (algebraic and transcendental) p-adic numbers in different prime spots”’ 
exists at all, then probably the values of an analytic function, e.g. 


re) 


a. (1 — a”)" 
loga = -—— = 





for a fixed a would form a “monogenic system of p-adic numbers.” 

In this way the study of special p-adic functions might furnish material for 
later structural analysis. 

Whether or not such monogenic systems of p-adic numbers do permit an 
intrinsic description is certainly an important problem. The relation to number 
theory proper is suggested by the formula 


p-l 


Legendre symbol (*) = lima?” 


neo 


n 





The law of reciprocity, which describes the dependence of this function on p, 
has long been considered as an analogue or as a manifestation of an analogue of 
the Cauchy integral theorem.*'® 


1. The results of I. Schur. 

(1, 1] Derinrrion: Given a sequence of numbers a, a1, +++ @n--+ and a 
number p ~ 0. Then the (Schur-)derivative Aa, of this sequence is another 
sequence 
Qn+1 — An 


Uy 
Gn - Aa, _ we 


Higher derivatives are defined as usual: 
A’an = aS” = A(a’’”) = A(A”a,). 


(1,2) Fermat’s Tuzorem. [If a is an integer and p a prime, then 


n+1 n 
Aa” al a? a a? 
_ prtt 


consists of integral numbers. 
SES 
‘Cf. Hilbert, Mathematische Probleme, Coll. Pap. vol. III., page 313. 


*A. Weil located several years ago the analogue more precisely. It turns out that 


Riemann’s bilinear period relation is the exact correspondent of the law of reciprocity. 
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We shall say “Aa’” is integral.” 
This interpretation of Fermat’s theorem is justified by the 


(1, 3) THrorEM or Scour. Jf a is an integer and p a prime not dividing a, 


then not only Aa””, but also the following derivatives 
ae ae a 
? 


up to the (p — 1)* are integral. 

From now on we shall confine ourselves (in the whole paper) to odd primes, 
for the sake of simplicity. This evidently does not affect (1, 3). 

(1, 4) Turorem (Schur): If under the assumptions of (1, 3) 


pal 
a? 


—]1 
—————. = 0 (mod. p) 
D Pp 


then every derivative A'a”” is integral. If 


| 
a” 


—— # 0 (mod. p) 


then (any number of) A’a”” has exactly the denominator p. 

We suppose the reader familiar with the elementary properties of p-adic 
numbers and state without proof 

(1,5) The theorems (1, 2)—(1, 4) are still true if we allow a to be a p-adic number, 
the terms “integral,” “divide” and “prime to” given their p-adic meaning. 

Moreover we see 

(1, 6) If (1, 2)-(1, 4) are true in the p-adic sense, then the original statements of 
Schur are true. 

This is due to the fact that for rational integers a the denominators of the 
derivatives are trivially powers of p. 


2. Preliminary p-adic interpretation. 

(2, 1) If Aa, is integral then the sequence ap ts convergent and the limit 1s congruent 
to d (mod. p). 

Of course a, would already be convergent if Aa, had bounded denominators, 
and this would still be a very particular and strong form of convergence. (Cf. 
the relationship between continuity and differentiability of functions). 

Fermat’s theorem (1, 2) consequently implies the convergence of a” for each 
p-adic integer a. We define 
[2, 2] x(a) = lim a” 


and state 
(2, 3) The function x(a), defined for any p-adic integer a, 


a) is multiplicative, x(a)x(b) = x(ab) 

B) satisfies x(a)”* = 1, if (a, p) = 1, 

Y) satisfies x(a) = 0, if a = 0 (mod. p), 

5) makes x(a) = x(b), only if a = b (mod. p). 
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Incidentally we have the formula 


p-l p—1 


+r ae 
is ) = lima = Legendre symbol (°), 


Up to this point we have been restating known facts; the following inter- 
pretation concerning the second derivative is new and as matter of fact was the 
point of departure for the further investigations. 

Instead of studying Aa” directly we consider 

gg? _ 4 


(2, 4] L,(a) _ 


pt 
(2, 5) The formula 
Aa” = a?” - L,(a) 
shows the connection; in particular L,(a) has the same denominator as Aa”’, if a is 
prime to p. 

Now we know from (1, 2) that in case (a, p) = 1, Aa” has an integral deriva- 
tive, hence from (2, 1) that it converges; a” converges on account of (2, 3)8) toa 
(p — 1)* root of unity, in particular not to 0; consequently 
(2,6) L,(a) converges and thus defines for (a, p) = 1 a function L(a): 
gq’? oS 


[2, 7] L(a) = lim ———. } 


n-»00 prt 
This function is except for a factor the p-adic logarithm of a. 
(2, 8) L(ab) = L(a) + L(0). 
The proof is easy: if a = b = 1 (mod. m), then 


ab-1 _a-1,b-1 ; 
= + - (mod. m); 














™m m 
now we have 
gq?) ss HP)" = 1 (mod. p™), 
hence 
(p—1) p™ — pasar 
: ss : + e _ -= oo = (mod. p*") 
or L,(a) + Ly(b) = Lq(ab) (mod. p”™); 


n— © yields the theorem. 

Two procedures are now possible. Either one develops the properties of 
L(a) and the analogue of the exponential function from this definition ((2, 7]). 
This would correspond in real analysis to basing the theory on (0, 8) and 


exp z = im (1 +2) ‘ 


no 
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Or the one introduces the functions log and exp by means of power series; the 
standard functional relations are easily derived therefrom. Based on these and 
the power series developments the properties of not we | L(a) and L,(a), but 
also of A”L,(a) and finally even the properties of A’a”’.can be obtained by 
almost straightforward computation. 


3. The power series for log and exp. In this paragraph we shall collect the 
simplest standard properties of the exponential and logarithmic functions in the 
p-adic case, based as usual’ on the power series 





(3, 1] expz = 2) - 
= vi 
[3, 2] log z = — > ap. 
=] 
(3, 3) ToErorEm: [3, 1] has a sense (i.e. the series converges) 
for z = 0 (mod. p); 


[3, 2] has a sense for x = 1 (mod. p). 
(3, 4) THEOREM: a) exp (2: + 22) = (exp 21)(exp 22) 
B) exp z = 1 (mod. p). 


(We use the letter z for numbers = 0 (mod. p) and letters x for numbers = | 


(mod. p).) 
(3, 5) THrorEM: a) log (x22) = log 1 + log x2 


B) log x = 0 (mod. p). 
(3, 6) THEorem: If p is odd and 
= 1 (mod. p’), a # 1 (mod. p’”), 
then 
log x = x — 1 (mod. p”) 
# x — 1 (mod. p”"’). 
Indeed 


logx—-(t#-1)=-(1- Dp te 


oo 


>> is congruent (mod. p) to its first member, 3; hence the difference is exactly 


v=2 


divisible by p”. 





6 Cf. Hensel, Zahlentheorie, p. 131 et seq. 
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Finally since log x = 0 (mod. p), exp z = 1 (mod. p), the expressions exp 
(log z) and log (exp 2) have a sense, and satisfy as in real analysis the 
(3, 7) THEOREM: 


exp (log x) = x 
log (exp z) = z. 
This implies for instance that log x = 0 only for x = 1; this follows also from 


(3, 6). 

The statements of the following paragraphs will be proved on the basis of 
(3, 1] — (3, 7) alone, whereas in §§1, 2 we assumed for purposes of analysis and 
explanation some of Schur’s results. 


4. Computation of L(a). If 2 = 1 (mod. p), then 2” = 1 (mod. p™”). 
Hence by the first part of (3, 6) we get 


(4, 1) log 2” = x” — 1 (mod. p’™””). 
If we use the functional equation of the log, we obtain 
(4, 2) p” log x = 2” — 1 (mod. p’”*”). 


We may divide this (ie. both sides and the modulus) by p”*’; we retain 


integral numbers, since log = 0 (mod. p), and x” — 1 = 0 (mod. p”*’), hence: 


logx 2” —-1 
: = ra (mod. p 


Now in (4, 3) the lefthand member is independent of n; consequently we may 
assert 


(4,3) THEOREM: ae 





Pa. 
(4,4) THEOREM: as lim int 
p p” 
Note that in practically the same fashion one could derive (2,9) from the 
definition [3, 2], in the real field at least, for | z —1| < 1. 
Furthermore (4, 3) includes the congruence 


prtl 


pt 


p”™ 


— 1 - 
pa (mod. p”*’) 





or, returning to the A-terminology 





(4,6) TazorEm: A (Z = ‘) is integral. 
ptt 
Let (a, p) = 1, a” "* = a, therefore = 1 (mod. p); then from the definition 
(2, 4] of L,(a) follows the 
(4,7) Tuzorem. AL, (a) is integral. 
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From (4, 4) arises (ef. [2, 7]) 
, ] sie 
(4,8) THEOREM: Lie) = tn ie) oe 


There is also a more elementary proof for (4,6) and (4,7), based on the 
formula 


(4, 9) gr os (x?")? a (1 rs p” X.)” 
where X,, is defined by 

2” —1 
[4, 10] Ma => pet 


Applying the binomial theorem one obtains 


p 
(4, 11) gn wt} «@ - (?) * xX’ 


v=1 v 


and 


Pp 
Xn = Be ot »» (?) a (v—2)—1 X?; 


v=2 


hence for AX, 
(4, 12) THEoREM: 


- Pp (n+1) (v—2)—-1 yy 
AX, = >> p Ans 


v=2 \V 
Pp 
The members of the >> are integral, since in the only dubious case v = 2, 
y=2 


(?) = 0 ( mod. p) by virtue of p > 2. This proves (4, 6) and (4, 7). 


But (4, 12) carries more information; it yields for p > 3 a simple congruence 
for AX, ; the identity (4, 12) in the form 


_ [P\1 ye nti Ce (P\ (nt 6-3-1 ye 
(4, 13) AX, = 9 oe te aes ,)P ae 


v=3 


shows for p > 3 the 





(4,14) THEOREM: AX, =? ™ l X?, (mod. p”*’). 


This is a considerable refinement of (4,6). Indeed it contains the fact that 
the difference AX,4: — AX, is divisible by p”™’, hence—if p > 3— 
(4,15) Tueorem. A’X, is integral. 
It is possible to proceed further in this direction; this would mean an extension 
of Rothgiesser’s proof to the congruences of type (4, 14). I hope to be able to 
discuss this on another occasion. 








(5 
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In the following paragraph we shall give a p-adic development for X,,, which 
is particularly suited to the Schur-derivation. 


5, p-adic representation of X, and A’X,. We shall now develop X, in a 
power series with respect to (log x)/p; the Schur derivatives may then be formed 
term by term, which yields a similar development for all A’X,. 

If we want to return from equations in p-adic analysis to congruences mod. p”, 
we have only to break off the power series concerned. If one prefers to avoid 
the transcendental function log, one has to replace (log x)/p, (mod. p"*") by X,. 











. —s 1 " 
X. = an re tom age”) ~ 41 
(5, 1) 1 ce} (p” log x)” co) nv—n—l 
ae Hs Ae - 
+ ae: 7 =) = (log x)’; 


this we shall mainly use in the form 


(5,2) THEOREM: 
( a 
> ge (v—1) Pp 


v=] Vv ! 


In order to get AX, and A’X, we have only to compute 


1 —] 
A’ p"* & a 


The formula 
(5,3) A’ ape (y—1) a are sili | p _ 1) lia ee 1) ae (p’ es 1) 


is easily verified by induction. 
Here the first factor p°—”? ‘**? 

case one of the remaining factors vanishes, namely p 

(5,4) Tozorem. The first r members of (5,3), considered as a sequence in v 

(i.e. for vy = 1,---r) are zero. The others are exactly divisible by on, 
Hence in the following formula for 


is fractional whenever v < r + 1; but in this 


v—(r—(v—r)) , 





(* sy 
A’ Xx. ai > (A’ ae a P 


v=] v ! 


we could make the summation begin from vy = r + 1: 
(5,5) Taroren: 


r(x” —1 r 2. (pt) (nti) (p"" —1)::: (p”* — 1) flog x\’ 
(7) -ex= Er = Pp : 





This formula (5, 5) is the key for all further statements about the A’X,. 
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6. Properties of A’X,. We are of course interested in the denominators of 
the A’X,, but denominators will arise only from the factors 1/v!_ Now p’/y! is 
always integral. Consequently, if 


log x 


log « = 0 (mod. p), or = 0 (mod. Pp), 


we are sure that + (82) is integral. Since in this case every term of > 
v 


5 v=] 
in (5, 5) is integral, A’X, itself turns out to be integral. Now we know from 
(3, 6) that this favourable situation arises if and only if z = 1 (mod. p’): 
(6,1) Tazorem. If x = 1 (mod. p’), then all A’X, are integral. 

It is further important that the first non-vanishing term in (5, 5) does not 
depend on n. We may write 


AX, = @-})-- (p’ —1) ("s a)" 
(r+ 1)! p 
(6, 2) P . 
a. Fgh owe (p”” —1)-- 


v=r+2 v ! 








i (p”* om 1) ("2 y 
p ) 


From this we will conclude that A"X, is congruent to the first term (which is, 
in general, fractional) modulo any power p™ of p, provided that the index n is 
greater than a number N(r, M) which depends on the order of derivation r and 
on the closeness p™ of approximation wanted. 

We determine N in such a way that, given r and M 


” ite (n+1) 
wae” Tih =0 (mod. p”) 


forv-—-r-—-1>0. 
Since p’/v ! is integral, it is sufficient to have 


p’? (™4-" = 9 (mod. p™) 
or 

(—r-In+1)—»—-M Z0 
for 


y—r-—1>0, n>wN. 


A possible determination is N = M + r+ 1; consequently: 
(6,3) THrorremM. For any M = 0 andr we have 


—_— eee ¥ _ r+l 
aX, = & - 7 nd ) (* *) (mod. p™) 





provided thatn > M +r. 





A fF r= oo 
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This statement, obtained by a very rough estimate, yields nevertheless a 
considerable amount of information, e.g. 
(6, 4) THEOREM. Any sequence A'X,, is convergent, and for the limit we have the 


equation 





; — = Pay) (lez) 
a eae p) 


Also 
(6,5) TxorEM. In case x # 1 (mod. p’) the denominator of A'X, forn > r is 
exactly the denominator of 1/(r + 1)!. In particular for r < p — 1,A°X, is 
integral, and for r = p — 1 the exact denominator is p. 
As a matter of fact the latter part of (6, 5) concerning r < p — landr = p — 1 
holds without restriction on n: 
2” —1 


eat Saye mt Gand 9), p > 8. 


(6,6) Martyn THEorEM. Let 
Then 
a) if x = 1 (mod. p’), every A’X, is integral; 
8) if x # 1 (mod. p’), A? *X, has denominator p; 
vy) AXn, --- , A” °X, are always integral. 


a) has already been proved (v. (6, 1)); the proof for 8) and y) consists in estab- 
lishing the integrity of the members in (cf. (6, 2)): 


© = (prt) (n-+1) y 
— ne log x 
P (p = 1) + =» (8), 


(6,7) p 2 


v=r+2 v! 
We prove now the 
(6,8) THzorem. p°~"™?"*” /y! is integral for v —r —1>0,r < p. 


If vy = w+ mp + --- + u,p’, then v! contains exactly p“'* ““* *“*; hence we 
have to confirm 


(v—r—1)(n+1)—-(mt+---+4u,) 20. 
Apparently it is sufficient to prove (6, 8) for n = 0: 
(yv—-r—1)—(m+---u) 20 
or 
(6, 9) Ww + u(p—1)+---+u(p’ —1) =>rtl. 


a) For» < p (6, 9) means uw >r+ 1, and this follows from m# = v > r+ 1. 

b) For y = p (6, 9) means p — 1 >r + 1; this follows again from p = vy > r + 1. 
¢) For » > p the left-hand member of (6, 9) is at least p, and this is = r + 1 by 
virtue of the restriction on r. 
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Finally we obtain a theorem about any a which is prime to p by putting 


—1 
z=a’. 


(6,10) THrorem: Let p > 2, (a, p) = 1, L,(a) = 
Then 


a) if a” * = 1 (mod. p’), every A"L,(a) is integral; 
8) if a” ¥ 1 (mod. p’), A? “L,,(a) has the denominator p; 
vy) AL, (a), «+ , A” *L»(a) are always integral. 


gg’? wet 


prt 


7. Congruences mod. p” for A"X,. In order to obtain sharper results we 
need only scrutinize our proofs concerning the members of (6, 7) in (6, 2). 

First of all we considered p” ""/v! alone, disregarding a factor p°~"" 
which is at least p". This is necessary if we do not want the modulus to depend 
on n. Otherwise we might have stated that A’X, is congruent to its limit 
(cf. (6, 4)) modulo p” in all cases covered by (6, 6) or (6, 10). 

On the other hand we did not investigate the actual p-exponent of (6, 7). 
We shall ask now in this direction: When is p” "'/v!, which we know to be 
integral (for »y > r + landr < p), not divisible by p? 

In this case we should have 


(7, 1) u +u(p—1)+---+u(p’—1)=r+1 
where 

(7, 2) yv=u+tupt:---+up>rt+i, o & i. 
(7, 3) r+1sp. 


a) v < p, we have uw = v, u; = Ofori ~ 0. 
Since v > r + 1, (7, 1) cannot be true. 

b) If v = p, then (7, 1) givesp -1=r+lorr=p—2,v=r+2. 
In this case, then, the first member of (6, 7) is exactly divisible by p”, provided 
that the logarithmic factor is prime to p. The other members are divisible by 
p”"’, hence the infinite sum (6, 7) is also exactly divisible by p”. 

c) If » > p we know that 


U+t+upt+---+up' > p 
and 
w+tu(p—1)+---+u(p’-—-1lh=r+1 Sp. 
The second inequality can be replaced by the equality by virtue of the first; 


pri 


is not divisible by p 





we obtain r = p— 1,» =p+1. In other words 7 
Vv. 


exactly when (r = p — 1)v=r+2. 


Again we see that A’X,, in general will be congruent to its limit exactly modulo p”. 
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We collect these results in the 
(7,4) Man THEOREM: 














a ee — ] r+l1 . 
. oX, 2 OD —D (1082) (mod. 9) 
forr = 1,2,---p — 3. 
ry — (p — 1) ++: (p — 1) (log xz\™ . 
8) AX, = (+i)! ; (mod. p”) 
forr=p-—2,p-—1. 
Fw 2 ae na l r+1 ss 
) ox, 2D aD (82) (mod. p* 
ifr=p—2,p—1, and 108 * 44 © (mod. 9). 


Finally we write (7,4) without logarithms, replacing (log x)/p, modulo 
vo” by Xx (ef. (4, 31)). 
(7,5) THEOREM: 











r sai (p" a 1) = (p rt 1) r+1 n+1 
a) A’X, = @ 1)! X’,” (mod. p”™) 
jforr = 1,2, --- p — 3; 
r _ ip — 1) ->- @— 1) r+1 n 
8) Ax, = G+)! Xi,” (mod. p”), 
forr=p—2,p—1; 
r (p" oad 1)  f (p = 1) r+1 n+l 
7) A’X, # GED! X’, (mod. p”"’) 


forr=p—2,p—1 and X, #0 (mod. p). 
8. Proof of Schur’s statements about A’a””. As stated before in §2 we get 
information about A’a”” from 
(2, 5) Aa” = a’’.L,(a). 


Evidently we need formulas which connect the derivative A(anbn) Of nbn 
with the derivatives Aa, and Ab,. We state these rules in the 
(8,1) TaHrorEen. 


a) A(dn + bn) = Aan + Abn 
B) A(anbn) _ anAbn + (Adn)Dn4i 
v) A(can) = cAan. 
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Now let us take the derivative of (2, 5): 
A’a”" = a” AL, (a) + (Aa””’)-Lny:(a) 
= a” (AL»(a) + Ln(a)Ln41(a)). 


(8, 2) 


By virtue of (6, 10) we get for p > 2, (a, p) = 1, the 
(8,3) TurorEem: A’a”” is integral. 

It is possible to derive more precise congruences for these and higher deriva- 
tives. This will be done in another paper. Here we content ourselves with the 
integrity of the first (p — 1) derivatives and the denominator for the p“ deriva- 
tive. For this purpose we do not need the precise expression of the higher 
derivatives in terms of the A’Ln,,. We use only 


(8, 4) A’a” = a’"(A”"L, (a) + P) 


where P is a polynomial with integral coefficients in the A’L,,,(a) with order of 
derivation v less than r — 1. 

This is easily proved by induction. 

Hence if AL, (a), --- ,A” “L,(a) are integral for any n, then A‘a”” is integral 
too. Moreover, under these assumptions A’™’a”” differs from a”"A"L,(a) only by 
an integral number, and thus these numbers have both the same denominator. 
This finishes the proof of Schur’s main theorem about the A‘a’”, since the 
A’L,(a) occurring in their representation according to (8, 4) are sufficiently 
known from (6, 10): 

(8,5) Marin THEorEM oF Scuur: If p is a rational prime > 2, a an integer 
prime to p, then 


a) Aa’, ... , A” "a" are always integral; 

8) if a”* = 1 (mod. p’), then all A’a”” are integral; 

y) if a” # 1 (mod. p’), then A?a” has denominator p. 
The statement holds in the rational or in the p-adic field. 





YALE UNIVERSITY. 
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COLUMN NORMAL MATRIC POLYNOMIALS 
By Merritt M. FiLoop 
(Received November 10, 1936) 


The principal questions answered in this paper are: 

I. What is the greatest lower bound for d[PX] when P is a given matric 
polynomial, X is a matric polynomial of prescribed rank, and d[PX] is the degree 
of the product PX? 

II. How can the degree invariants of the column vector space of C,[P] be 
expressed in terms of the degree invariants of the column vector space of P, 
where P is a given matric polynomial and C,[P] is its r** compound? 

III. How can the problem of finding the matric roots of a matric polynomial P 
be reduced to the consideration of regular matric polynomials? 


1. Matric polynomials. Let e?7 be the matrix with p rows and q columns, 

all of whose elements are zero except the one in the r row and s* column which 
Pp q 

is unity. The matrix A”"[A] = >> >> A™[A]e?2, where A” [A] are polynomials 


r=1 s=1 


whose coefficients and variable \ lie in the complex field, may be written in the 
form A?{\] = >, A”, \* where A™®, # 0. The matrix A?*[A] is called a matric 
k=0 


polynomial of order [p, q] and degree n with coefficients A”, over the complex field. 

It will be convenient to denote the degree of A”“[A] by d[A”{A]], its rank by 
pl[A™[\]], and its transpose by A’™{\]. Furthermore, A”“[A] will be called 
regular if p[A”"[d]] = min [p, q], square if p = q, and elementary if [A”[\]]* is 
also a matric polynomial. 


2. Column normality. For simplicity, set P = A”[A], p = p[A”* [AJ], 
m= — p, & = ey, and d{P] = d[Pe] for (k = 1, 2,---,q). The vector 
polynomials' Pe, for (k = 1, 2,---, q) determine an integral set’ W[P] of 
rank p. The matric polynomial A”{\] is said to be column normal if and only 
if it satisfies the following three conditions: (i) Pe, = 0 for (k = 1, 2, --- , m), 
Gi) dmii[P] S dnmyo[P] S --- S dP], and (iii) Pe, for (k = m + 1, m + 2, 

‘+, @) form a normal basis’ for W[P]. The integers d;[P] for (k = m + 1, 
m-+ 2,---,q) are the degree invariants’ of W[P] and are called the column 
moariants of A?*[\]. It is convenient to set d,{[P] = 0 for (k = 1, 2, --- , m). 
(Similarly, P is said to be row normal if P’ is column normal and its row invariants 
are the column invariants of P’.) 


a 





‘Wedderburn, J. H. M.: Lectures on M atrices, p. 47-49. 
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It follows immediately’ that an elementary polynomial E% [)] always exists 
such that N?{A] = A“[AJE™[A] is column normal. Of course, EZ“ [A] is not 
unique but any polynomial such as N*“[A] is called a column normal form of 
A*{y]. 


3. THzorem I. (1) If P = P’[\) and X = X*[A] are matric polynomials, 
then d{PX] = dyx[P]. (2) If P is a matric polynomial of order [p, q| and j and 
k are integers such that 0 < j S qandk 2 j, then a matric polynomial X of order 
[q, k] and rank j exists such that d[PX] = d,{P]. 

Proor. (1) Let N = PE be column normal where E is elementary and set 
L = E'X. If d{[NL] < dN] it follows immediately’ that L’ef} = 0 for 
(k = p{L], p[L] + 1, --: ,q) which is clearly impossible since p[L] = p|X]. 
Hence d[NL] = dp [N] or d[PX] = dyjx[P]. (2) Let N = PE be column 

2 


normal where E is elementary and set M = a. Obviously X = EM satisfies 


a=1 


the required conditions. 


4. Degree invariants of compounds.’ If P is a matric polynomial of order 
[p, q] and E is elementary then C,[PE] = C,[P]C,[E]. Since C,[E] is elementary 
it follows that the column invariants of C,[PE] are the same as those for C,[P] 
and so it is sufficient to consider the case in which P is a column normal matric 
polynomial. In fact, it will be convenient to ignore the first ¢ — p[P] columns 
of P and consider the case of a regular column normal matric polynomial, since 
this obviously involves no loss of generality. 

The lemma which follows is stated without proof and might have been taken 
as the definition of a regular column normal matric polynomial. 

Lemma A. The regular matric polynomial N = N?*[)] is column normal if and 
only if it satisfies the following conditions: (i) q = p[N], (ii) di S$ d2 S --- Sd, 
and (iii) the product Q = ND is proper (leading coefficient regular), where 


q 
D= 2 ett i = =—d =d{N], and d& = d[Nefi]. 
=1 
Now consider the relation C,[Q] = C,[N]C,[D] where C,[D] is clearly a diagonal 


matric polynomial with powers of \ in the diagonal. Let the compound be 
6 


defined so that C,[D] = >> n**ett where @ = (2) and Bi = Be =>--- =o. The 


k=1 
8; are the various sums of the integers d — di, d — dz, --- ,d — dy taken ata 
q 
time, and in particular 6p = >) (d—d;,). Hence, if Q, = C,[Q]/d” and D, = 
k=q—r+1 


C,[D]/X”* , then Q, = C,[N]D, and C,[N] is column normal since Q, is proper. 
Let f; be the column invariants of C,[N] and set a, = B, — Be for (k = 1,2, 





2 Tbid. p. 49. 
3 Tbid. p. 64. 
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...,6). Then fi = d{(C,[N]] — ax but d[Q,] = d[C,[N]] = d[C[Q]] — Be = 
rd(N[ — Be and sof, = rd — B. This proves 


TuzoreM II. If P is a matric polynomial of rank p with column invariants 
d,for (k = 1,2, --- , p), ofr ts an integer such that0 <r S p, if yx are the various 
sums of the integers dy, dz, --- , dp taken r ata time and such thaty, S y2 S --- Sy 
where 0 = (7), and af fx for ts = 1,2, --- , 6] are the column invariants of C,[P); 


then f, = Ye- 


5. ToeorEM III. The left zeros of a matric polynomial P of nullity n are the 
same as those of the regular polynomial obtained from any column normal form of P 
by omitting the first n columns. 

Proor. (i) Suppose that R is a left zero of P. By the remainder theorem‘ it 
follows that P = [A — R]JQ where Q is a matric polynomial. If N = PE isa 
column normal form of P then R is a left zero of N since N = [A — R][QE). 
The first n columns of N are zero so necessarily the first n columns of QE are 
likewise zero. If No is the polynomial obtained from N by omitting its first n 
columns then clearly R is a left zero of No since No = [A — R][QE]o where [QE] is 
the polynomial obtained from QE by omitting its first n columns. 

(ii) Conversely, if R is a left zero of No it is obviously a zero of N and hence of 
P so the proof is complete. 


This result reduces the problem of finding left zeros of matric polynomials to 
the consideration of regular column normal matric polynomials. 


6. Further results. 

Corotuary 1.1. If P is a non-singular matric polynomial whose greatest 
column invariant is dq then there exists a matric polynomial R such that PR is 
proper and of degree dg. Furthermore, d[PR] = d, for every non-singular matric 
polynomial R. 

Proor. This corollary is an immediate consequence of Theorem I and 
Lemma A. 


Corotiary II.1. If N is a non-singular column normal matric polynomial of 
order n then the adjoint of N, with its rows in reverse order, is row normal. 

Proor. Since [adjN]’ = C,_,[N] it follows from the proof of Theorem II 
that [adjN]’ is column normal if its columns are properly rearranged. However, 
it is obviously only necessary to invert the order of the rows and of the columns 
of D,1 to make 6; = B, > --- = B, in this case and hence only necessary to 


— “4 the columns of [adjN |’; which has the same effect as inverting the rows of 
adjN 


es 


‘ Thid. p. 22, 
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Corouuary II.2. If P is a non-singular matric polynomial of order n with 


column invariants d, jor [k = 1, 2, --- , n] then the row invariants of its adjoint are 
fi = he i.| —_ dn—z41 for [k = . 2, adie n\. 
a=1 


Proor. This follows immediately from Theorem II since [adjP]’ = C,_,[P}. 


(It should be noted that > da is the degree of the determinant of P.) 


a=1 


Coroutiary III.1. If a matric polynomial P has a left zero R then all of its 
column degree invariants are greater than zero. 

Proor. If Risa left zero of P then it is a left zero of a regular column normal 
matric polynomial N. Hence N = [A — R]Q which is clearly impossible if all 
elements of some column of N are constants. 
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CLASS FIELDS OF INFINITE DEGREE OVER p-ADIC NUMBER FIELDS 
By Orro F. G. ScHILLine 
(Received May 20, 1936) 


In this paper we shall develop a theory of the infinite abelian algebraic exten- 
sions of p-adic number fields. We try to characterize these fields by certain 
groups defined in the ground field. In the case of finite abelian extensions these 
groups are the norm class groups’ and our goal is to find a substitute for these. 
The natural approach to this is to consider the set of all finite subfields contained 
in the given field of infinite degree. All these subfields define norm class groups 
in the ground field. With the help of these groups we are able to define a system 
of neighbourhoods in the multiplicative group of all elements (not equal to zero) 
belonging to the ground field. The closure of this topologized group is the 
generalization of the finite class groups. It is isomorphic to the Galois group 
of the infinite abelian extension. To establish this fact we use the theorems of 
the known finite local class field theory. We may mention that the problem is 
slightly simplified by the remark that the infinite extensions which are con- 
sidered here are allenumerable. Therefore we need only an enumerable system 
of neighbourhoods to describe the topology in the space mentioned above. 

Before going into the details of the construction we first state some well 
known facts about the local class field theory and the Galois theory of enumerable 
infinite extensions.” 


1. Let k be a finite p-adic number field of the most general kind, that is to 
say, a field which is perfect with respect to the discrete valuation B of rank one 
and the field of residue classes belonging to the prime ideal p of k is a finite 
Galois field.’ For example, all the possible p-adic extensions of finite algebraic 
number fields are such fields. 

Let us denote by k* the multiplicative group of all elements in the field k 
that are not equal to zero. Then the factor group k*/k*” is a finite abelian 
group for every positive integer n.* 





‘Cl. Chevalley, La théorie du symbole des restes normiques, Crelle vol. 169 (1933). 

? For details on infinite algebraic extensions cf. M. Moriya, Theorie der algebraischen 
Zahlkérper unendlichen Grades (MI), Journ. of the fac. of sci. Sapporo ser. I. vol. ITI (1935). 
Galoissche Theorie der algebraischen Zahlkérper unendlichen Grades (MII), same journal 
ser. I. vol. IV (1936). 

_ ‘Such fields are, in addition to the p-adic extensions of number fields, the perfect exten- 
_ abstract fields of functions. Cf. H. Hasse in different papers in Crelle vol. 173 

‘For number fields e.g. K. Hensel, Allgemeine Theorie der Kongruenzklassengruppen 

und ihrer Invarianten in algebraischen Zahlkérpern, Crelle vol. 147 (1917). 
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In the local class field theory all abelian extensions K of degree n over k are 
described in a one to one way by the different subgroups of index n of the group 
k*/k*". 

If we denote by K* the multiplicative group of the field K, by Nx,» the norm 
of K relative to k, and by Gx, the Galois group of K with respect to k, then we 
have the isomorphism 


Crk => k*/Nxj.K*. 


This is the theorem of isomorphism. 

The actual isomorphism is furnished by the norm residue symbol. For 
every number a ~ 0 of k there is defined the symbol (a, K)° which is an element 
of the galois group Gx;,. This symbol has the following properties 


i) (a, K)(b, K) = (ab, K), 
ii) (a,K)=1  ifandonlyif a=Nx,A  withAinK, 
iii) (a, K) — (a, K’) if the substitution is applied on a subfield 
K’ of K which contains k, 
iv) (a, K) runs through all elements of Gx; if a runs through all elements 
of k*. 


By property ii) of the symbol the group of all elements a of k such that 
(a, K) = 1, coincides with the norm group Nx;,K*. 

The properties i) and iv) show that the mapping of k*/Nx,,K* in the group 
Gx, is anisomorphism. This theorem is called the law of reciprocity. 

Let us denote the norm group, Nx;,K*, which belongs to the abelian extension 
K of k by H(K). 

The subfields K’ of K containing k correspond in a one to one way to the 
groups H’ lying between k and H(K) = H, and since 


KCK’ QCKekDH ad, 


we see that H’ = H(K’). (Theorem of ordering and unicity.) 

Furthermore it is shown in the local class field theory that for given sub- 
groups H of finite index in k* there exist abelian extensions K of k such that 
H(K) = H. (Theorem of existence.) 

Now let L be an infinite normal algebraic extension of k, and let G denote the 
group of all automorphisms of L relative tok. If K is an’arbitrary finite normal 
extension of k which is contained in L, let Gj, = G(K) again denote its Galois 
group with respect tok. Every automorphism g of G furnishes an automorph- 
ism g(K) of G(K) if g is applied to the elements of K. All elements g’ of G which 
furnish the same automorphism g(K) as g are then put in one class U(g, X). 
These classes form, for variable fields K in L, systems which have the formal prop- 
erties of neighbourhoods of a topological space. The group G of all automorph- 
isms of L over k is then closed with respect to the topology we have introduced. 





; ie , K/k ies te 
5 We write (a, K) instead of (a4 ) which is the usual notation. 
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The fundamental theorem of the Galois theory then states that the closed 
subgroups S of G correspond in a one to one way to the subfields F of L. 

This most general topologization of the Galois group G can be slightly modi- 
fied. We have already assumed that L is algebraic with respect to the p-adic 
number field k. This means that L can be approximated by countably many 
fnite normal fields K;, such that 


kG---GKuGKC---CL and 
L is the sum of all the fields K;.° 


The fields K; also define neighbourhoods U(g, K;); it turns out that the to- 
pology which is defined by this countable set of neighbourhoods is the same as the 
topology defined by all finite subfields. 

Let G; be the Galois group of the field K; with respect to k, and let g; denote an 
arbitrary element of G;. Instead of U(g, Ki) we may also write U(g;, K;) if 
g furnishes the automorphisms g; = g(K;)in K;. A sequence of neighbourhoods 
U(g:, Ki) such that 


U(gix1, Kiss) © UYi, Ki) 


in the sense of the theory of sets as applied to the space G will be called a funda- 
mental sequence. It is obvious that the elements g of G correspond uniquely 
to the fundamental sequences of neighbourhoods. 

This fact can be expressed in a more formal way as follows. For every index 1, 
the galois group Gi: is a homomorphic map of the Galois group G;. This 
homomorphism may be called ¥;. With respect to these homomorphisms we 
define fundamental sequences (91, g2, --- , gi, --: ) of elements g; of the groups 
G;. We postulate that g;.1 = giv; for every index 7. 

The product of two sequences (91, g2, «++, gi, +--+) and (91,92, --+,9i,-:-) 
will be defined as the sequence (gigi , 9292, --+ 9igi, +--+): 

Again these sequences form a topological group and one easily shows that it is 
isomorphic to the group G. 

An immediate consequence of the finiteness of the groups G; is that the group 
Gis a compact group." 

The last definition of G has the advantage that it can be better used for the 
more detailed study of the infinite fields.® 

Finally, we observe that the formal product over all the relative degrees 
(Ki. Ki) = nj isa G-number N = lim;.,,] [}-17; in the sense of Steinitz.’ It 
depends on the special approximating sequence K; of the field L. Later on we 


shall use this G-number for a special construction. 
a a el oe Tl 

* By the sum or compositum of fields K; we understand the smallest field K in the alge- 
braically closed extension Q over k which contains all the fields Ki. 

"Cf. W. Krull, Galoissche Theorie der unendlichen algebraischen Erweiterungen, Math. 
Annalen vol. 100 (1928). 

*Cf. MIT. 


*E. Steinitz, Algebraische Theorie der Kérper, Crelle vol. 137 (1910). 
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2. Let L be an infinite abelian algebraic extension of the p-adic field k and 
{K;} an approximating sequence of finite subfields containing k. Then the 
norm groups H(K;) = Nx; nKi for 7 = 1, 2,--- form a properly descending 
sequence of groups in k: 


k* > H(Ki) 2 H(K:) D--- DA(Ki) D---. 


The intersection of all these groups will be called N. The group N is uniquely 
determined by the approximating sequence {K;}. 

We now form the class group k*/N. Its elements will be denoted by (a). 
This group has the property that it can be mapped on the class groups k*/H(K;) 
for i = 1, 2,---. The subgroup which corresponds to the homomorphism 
k*/N — k*/H(K;) is H(Ki)/N, because (k*/N)/(H(K,)/N) = k*/H(K)). 
Furthermore, k*/H(K;) is mapped homomorphically on k*/H(K,-:): this 
homomorphism will be denoted by ¢; . 

These two homomorphisms suggest a topologization of the group k*/N with 
respect to the class groups k*/H(K;) as approximations. 

In order to obtain this topology in k*/N we proceed as follows. First we get a 
representation of k*/N. To do this let (a) be an arbitrary element of k*/N. 
Then (a) is congruent to a well defined residue class a; of k*/N modulo H(K;,)/N. 
Let us put together all these residue classes (which can be represented as classes 
of k*/H(K;) as a consequence of the isomorphism (k*/N)/(H(K;)/N) = 


k*/H(K;) in the TOW (a1, ae, see » Ss, see ). 

The notion of equality in k*/N carries over to the representation of the 
elements by these rows, that is (a2) = (b) if and only if (@,, de, ---,di,---)= 
(b:, be, ---,6:,---). This is an immediate consequence of the fact that the 
elements (a) are classes of k* modulo N. 

The next step is to introduce fundamental sequences (@ , 2, --- ,€i,--:) =C 


of elements é; in k*/H(K;). A sequence c is called a fundamental sequence 
if and only if ¢1 = é:g;. The definitions of equality and of product of two 
fundamental sequences are again obvious. 

The group k = k(L)”° of all fundamental sequences will be called the adjoint 
group of the infinite abelian algebraic extension L in the ground field k. As in 
the Galois theory of infinite algebraic extensions it can be shown that k is a 
compact group.” The group k contains the class group k*/N. 

To this construction there corresponds the introduction of a system of neigh- 
bourhoods in k*/N. The neighbourhood U;,z; is composed of all elements (a) 
of k*/N for which (a) is congruent to a; of k*/H(K;),j = 1, 2,---,%. This 
means that the neighbourhood U;,a; is made up of all the elements (a) which 
lie in the same residue class (a)-H(K;)/N. These systems of elements form a 





10 We shall see that this notation k = k(L) is justified. The closure of k*/N does not 
depend on the special approximation {K;} of L. 
11 Cf. no. 7. 
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denumerable set of topological neighbourhoods. It is easy to verify that all 
the postulates for topological neighbourhoods are fulfilled.” 
A sequence Uj,a, for h = 1, 2, --- is called a fundamental sequence of neigh- 


bourhoods, if 
Rit .. Uh .an . 


The fundamental sequences of neighbourhoods are in a one to one corre- 
spondence with the fundamental sequences c. The group k is the closure of 
k*/N with respect to this topology. 

Remark: The system of neighbourhoods we just constructed could be 
replaced by a slightly more general system. Let K be an arbitrary finite sub- 
field of the given field L and H(K) its corresponding norm groupink. Further- 
more let N’ be the intersection of all H(K). We then consider the class group 
k*/N’. Now all classes (a)’ which lie in the same residue class of k*/N’ modulo 
H(K)/N’ constitute the neighbourhood Ux,a,. These sets of elements also 
fulfill all postulates of topological neighbourhoods.” The compact space k’ 
with respect to this topology of k*/N’ is isomorphic to the space k which was 
defined with respect to a defining sequence {K;} of L. The reason for this 
fact is that each field K contained in L is a subfield of a suitably chosen field K; 
of the special approximating sequence K;. Furthermore N = N’. The 
denumerable system of neighbourhoods U;,<; is then equivalent to the system of 
all Ux,ag. Therefore the spaces k and k’ can be considered as identical. In 
the following investigation we prefer the more explicit representation of k 
as the group of all fundamental sequences c. 


TueorEeM I. The adjoint group k of an infinite abelian extension L is iso- 
morphic to the Galois group G of L. 

Proor. We shall prove the isomorphism by writing down explicitly a 
mapping between k and G. For this purpose we apply the theory of the norm 
residue symbol (a, K) which establishes in the finite case the law of reciprocity. 
Let us abbreviate the symbol (a, K;) to x:(a). Property ii) of the finite norm 
residue symbol asserts that x(a) depends only on the class of a modulo the 
norm group H(K;). Therefore we can write x;:(a;) = x:(d:), where d; denotes 
the class, modulo H(K,), which is uniquely determined by the element a; . 

Now let a = (G1, de, ---,d;, +--+) be an arbitrary element of the adjoint 
group k. We now define a symbol x(a) for all elements a of k by 


x(a) = (xi(di), xe(de), --- , xi(Gi), --- ). 


This symbol is multiplicative, i.e. x(a)-x(b) = x(ab) for any two elements a 
and b of the group k. 
eects 

"UNV is a neighbourhood because H(K) 1 H(K’) is the norm group of KK'CL. 
Separability: If (a) (b) then there exist U((a)) and U((b)) such that U((a))N U((b)) = 0, 
because (a)(b)-! = N/N! 

* Cf. again no. 7. 
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First we observe that x(a) is an element of the Galois group G, for x;(a;) = 9 
is an element of the Galois group G; belonging to the field K; of the approximating 
sequence. Therefore there remains to be shown only that 


Xi-1(Gi-n) = xi(Ga)yx for every 1, 


and this is a consequence of the property iii) of the norm residue symbol. We 
see now that 


xi(@;) = xi(ai) > xin(as) = xiu(GiiNx;%.Ai-) 
= xi-1(d;-1). 


Here A;_; denotes an element of the field K;; such that the element a; which is 
an arbitrary representative of the class d@;, becomes equal to aj41Nx,_,,Ain 
where a;_; is a representative of the class d;1. The assumption that a is a 
fundamental sequence guarantees the existence of such an element A;-_;. 

The symbol x(a) furnishes an isomorphic mapping of k on G. 

We must show that every element g = (91, ge, --- , gi, -- -) of the group G can 
be obtained. Again we can refer to the fundamental properties of the finite 
norm residue symbol. By property iv) there always exist elements a; of the 
field k such that x;(a;) = gi for given g; and by virtue of property ii) the elements 
a; are determined up to norms Nx,;/;,A; of elements A; in K;. 

We now form the system (di, dz, ---,G:, --- ) where the a; are the uniquely 
determined residue classes of the elements (a;) modulo H(K;,)/N. Then that 
infinite row is an element a of the adjoint group k. We must show that 


Gigi = Gi-1. 
But gi = xi(di) > Gia = xia(Gin) = gi: 
by construction 
and gi = xi(Gi) > xi-1(ai) 


by property iii) of the symbol. Hence x;-:(d:1) = x:-1((a:)) or (ai) is con- 
gruent to d;_, modulo H(K;_1)/N, and this is equivalent to d;1 = digi. 

We are now able to state some corollaries concerning the relations of the 
subfields F of L and the closed subgroups h of k. 

The existence of the isomorphism x between the adjoint group k and the 
galois group G allows us to translate the fundamental theorem of Galois theory, 
namely that there is a one to one correspondence between the subfields F of L 
and the closed subgroups F of G, into a theorem about subgroups of k. 


Corotiary 1. The closed subgroups h of k correspond uniquely to the subfields 
F of Land the Galois group of F over k is isomorphic to the factor group k/h. 

W. Krull has shown that a closed subgroup h of k is uniquely determined by its 
components h; in k*/H(K,). In this case the components h; are determined 





14 Cf. no. 7. 
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by the norm groups H(K; M F) corresponding to the intersections of the asso- 
ciated field F and the fields K; of an approximating sequence of L by the relations 
1, = H(K; 9 F)/H(K)). 

The intersection and compositum of any two closed subgroups h, and he 
are again closed subgroups of k. The first is obvious, the latter needs a more 
detailed consideration which may be found in the above mentioned paper of 


M. Moriya.” 


Corotnary 2. If F; and Fs are two subfields of L and hy and hg are their 
corresponding subgroups in k, then 


FiFooh Nb, and F,N Fob. 


If we introduce the group K of all continuous characters of k and denote by H 
the group of characters which map the subgroup h of k into unity, then we can 
state both corollaries in a more direct form :" 


CorotiaRy 1’. The subgroups H of the group of all characters of k correspond 
uniquely to the subfields F of L and H 1s the group of all characters belonging to the 
associated group of F. 

Corotuary 2’. If F, and F, are two subfields of L and H, and Hy are the char- 
acter groups of their associated groups, then 


FF, H,He and Fi, N Fe H; N He. 


The set of all finite subfields K of an infinite abelian field L over k has the 
property that for any two subfields K and K’ their compositum KK’ and their 
intersection K 1 K’ belongs to it. The equivalent fact about the set of all 
norm groups H(K) is that H(K) NM H(K’) and H(K)H(K’) belong to it. 

Now let S’ be a set {H(K)} of subgroups of k which has the property just 
stated. Let n denote the finite index k*/H(K). The least common multiple 
of all the integers n is a certain G-number N. Let n; be an arbitrary approxima- 
tion of N, so that lim n; = N. 

The factor groups k*/k*"* are all finite. We form the intersection of all 
groups H(K) that lie between k* and k*”* and call it H;. The groups H; 
form a subset S of S’. The intersections N’ and N of all groups of S’ and S 
are then equal by construction. The class group k*/N can be topologized with 
respect to the approximations k*/H(K) and k*/H; and again both topologies are 
seen tobe equivalent. Let k be the closure of k*/N with respect to the system S. 


Turorem II. For any closed group k belonging to a defining system © of 
subgroups in k there exists an infinite abelian field L over k whose adjoint group 
K(L) is equal to the given group k. 


SD 
* Cf. MII. 


ms L. Pontrjagin, The general topological theorem of duality for closed sets, also, The theory 
0! topological commutative groups, Ann. of math. II, ser. 35 (1934). 
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Proor. The existence theorem of finite abelian extensions over the field | 
asserts that there exist abelian fields K; over k whose Galois groups are jgo- 
morphic to k*/H;. They are uniquely determined. Since the groups H, 
form a descending sequence, --- D Hi: DH; D--- , the fields K; form an 
ascending sequence of fields, --- C Kia GK; ---. Thesequence {K;} is the 
approximating sequence of an infinite abelian extension L of k. The field L is 
uniquely determined. It is obvious that the adjoint group k(ZL) is equal to k. 
(Because there we have H; = H(K;).) 

Each abelian extension M of k whose degree m is a divisor of the G-number N 
may be called a field of exponent N.” Again let lim n; = N. Then we put 
H,;=k*"'. To this special set of subgroups belongs, as we have seen, a uniquely 
determined abelian field F. Then the following corollary holds and its proof 
is quite obvious from all the proceeding observations. 


Corotuary. Every abelian field M of exponent N is contained in the field F 
and the fields M correspond uniquely to the subgroups of the character group of k(F). 

ReMark. The unramified infinite extensions W of k which are all fields of 
roots of units correspond to the different topologies that can be introduced in 
the infinite cyclic group.” This follows from the fact that all units of the 
field k are norms in the case of unramified finite extensions.” Therefore N is 
equal to the group of all units and k*/N is isomorphic to the infinite cycle {p} 
generated by the prime element p of k. 

For instance if we take as neighbourhoods of unity in {p} the groups U; = 
{p" } fori = 1, 2, --- we obtain the additive group of all integer g-adic numbers 
as Galois group of the field of all (P*’ — 1) — st roots of unity. Here the number 
P denotes the number of residue classes of the field k, modulo its prime ideal 
(p), and q an arbitrary prime. Or, if we take U; = {p”*"*’'\"*} where n; denotes 
the 7 integer as neighbourhoods, we have the field W of all roots of unity; its 
group is isomorphic to a Cantor group or monothetic group. 





Tue INSTITUTE FOR ADVANCED StTupy. 








17 We say m divides N if m can be approximated by divisors of n; . 

18 For the theory of the unramified and cyclic finite extensions cf. H. Hasse, Uber p- 
adische Schiefkérper und ihre Bedeutung fiir die Arithmetik hyperkomplexer Zahlsysteme, 
Math. Annalen vol. 104 (1930). 

19 Cf. no. 18. 
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NOTE ON MATRIC ALGEBRAS 
By HERMANN WEYL 


(Received December 3, 1936) 


1. Preliminaries. In §1 and Appendix 7 of my paper, Generalized Riemann 
Matrices and Factor Sets, I established the theory of matric algebras by operat- 
ing with the matrices themselves and their vector space rather than with 
abstract elements, and getting along without the discussion of the radical and 
its influence upon the structure of the algebra. One can treat the X-multipli- 
cation in the same style and thus complete the theory, as I propose to show 
briefly in this note, in which I make use of the same notations and nomen- 
clature as in the cited chapter. At the bottom we have a (commutative) 
field k; the words “in k’’ should tacitly be supplied to all terms like “matrix,” 
“algebra,” “irreducible.” 

The set of all d-rowed matrices (transformations in a d-dimensional vector 
space) is denoted by Ma (complete matric algebra). EH = Ez is the unit matrix. 
The linear closure of a given matric set 2% = {A} consisting of all possible 
linear combinations of the elements A of %& with coefficients in k will be indi- 
cated by 


[2] or [A]a ing. 


The set of matrices 


|A 0--- O|| An --+ Aw 
PSL Of rows) and fo S| 
0 0-.-A Aw +++ Aw 


where A or the Aj, vary independently in % is called u- WU and Y%, respectively; 
they are algebras if % is such. By transition to a new codrdinate system a 
set { of linear mappings A passes into what is called an equivalent set (~ 2). 
When 6 variables 2, undergo a linear substitution A and d variables y,, undergo 
a linear substitution B, then the éd products x,y, undergo the substitution 
AX B. If the 2, and y, are looked upon as components of vectors z, y in a 
- and d-dimensional vector space rx and y respectively, then zu. = 2.y, are the 
tomponents of a vector z = zy in the éd-dimensional product space ry. If A 
tanges over a set 2 and B over %, we mean by A X B the set of all matrices 


AX B, Ain Y, Bin &. 
ee 


‘Annals of Math. 37 (1936), pp. 709-745. 
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This is not an algebra even if 2% and Bare such. The linear closure 
[A x Bla in&,BinS 


is to be denoted by [2% X %B] and will be called the algebra product of the algebras 
% and B. We have 


A, = (M,. XK A], 
hence 


[A X B] = [A XK Bho. 


Lemma (1-A). Let R = {C} be an algebra of transformations in a d-dimen- 
sional vector space r, containing Ez. The product pv of vt with a vector space p 
of dimensionality 5 may be considered as the substratum of the transformations of 
Mm; x R. Then each of its invariant subspaces is of form pt’ where r’ is an 
invariant subspace of t. In particular, irreducibility of R entails the same for R;. 

Of this lemma I made use in the proof of the criterion, Theorem (1.3-D), l.c. 
Its demonstration is fairly obvious. Let «, ---,«; be a basis of p. Each 
vector z in pt may be decomposed according to 


Z2= 2 +--+ + e525 (z, vector in tr). 


E,, being the 6-rowed matrix which has a 1 at the crossing point of the ¢” row 
and the «‘* column and 0 elsewhere, application of the operation E,, X E shows 
that our invariant subspace f of pr contains the parts ¢,z, of each z int. The 
operations like Ey, X E prove that with «2; (2: in r) also ez; lies int. Conse- 
quently ¢ is of the form pr’, and the operators E X C force the subspace r’ of r 
to be invariant. 

An abstract division algebra p = {y} of order 6 is irreducibly represented by 
associating with y the substitution 


y*: t= yt (é varying in p) 
(regular representation). p* = {y*}. The substitution 
f= by 


may be denoted by y« and the set of all ys by px. A division algebra is normal 
provided the centrum consists of the multiples of the unit only. In the Appen- 
dix I proved the following statement concerning normal division algebras by 
way of a simple application of Burnside’s criterion: 

Lemma (1-B). Let p be a normal division algebra. The & substitutions 


f = agp 


one obtains by letting a and B run independently over a basis & , «++ , & Of P yield 
a basis for the complete matric algebra Ms . 
One might put this down in the following formula 


[a* Bxla,@ inp = Ms . 








ne 
to 





zebras 


imen- 
pace p 
ons of 
is an 
rw R;. 
), Le. 
Each 


in t). 


h row 
shows 

The 
onse- 
‘oft 


od by 


in p) 


ormal 
ypen- 
is by 


yield 





NOTE ON MATRIC ALGEBRAS 479 


9. The basic argument. We are now going to consider the product pr of a 
normal division algebra p of order 6 and a d-dimensional vector space rt subject 
to the transformations C of a given irreducible matric algebra R = {C}. The 
space pt is considered the substratum of the transformation set 


p* x R = {y* x Chi ins.ciak- 


We then maintain that pr splits into a number u of irreducible invariant sub- 
spaces in each of which y* X C induces the same transformation; or that we 


have an equivalence 
(21) eX Kk~u-§, irreducible. 


This statement and its proof are the backbone of our whole discussion; the 
rest is mere juggling around and interpretation of the result. We proceed as 


follows. 
The vectors x of pt are expressed in terms of a basis ¢ , --- , eg of r as: 
(2.2) 2= he t--> + baea (&; in p). 


Considering p as a “quasi-field” in which the coefficients £; vary freely, we 
define 


yx = (vile: + +> + (véadea, ay = (fryer + ++ + (Eav)ea. 


An invariant subspace [ of pr is certainly a subset of vectors x of form (2.2), 
closed with respect to addition and front multiplication ( — yx); for the 
latter operation is what we formerly denoted by y* K E. Hence! has a p-basis 
l,,:++,1,in terms of which every z in [ is uniquely expressible as 


t= ml + didi + taln (ns in p), 


and the dimensionality én of { is a multiple of 6. 

8 being a given quantity in p, the space [8 containing all vectors xf (z in I) 
is invariant with respect to y* X C as well as 1, and y* X C induces therein 
the same transformation as in I. By making use of a basis « = 1, --- , 6 of p 
we apply the “typical argument” to the row of irreducible invariant subspaces 


L=le=Lkb = le, +++ [5 = les 


and thus succeed in picking out a number among them which by a proper 
arrangement may be denoted by l1, --- , f, such that 1) lh, --- , l, are linearly 
independent, and 2) each I, (« = 1, --- , 6) is contained in the sum 


h+---+h =. 


The latter fact shows that ({) is also invariant with respect to back multipli- 
tations: ({)¢, is contained in (1) for « = 1, --- , 6. Consequently () is invariant 
with respect to all transformations of the type 


a* Be X C, a and 6 varying over a basis of p, C in R, 
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and thus, according to Lemma (1-B), with respect to Mts X R. Lemma (1-A) 
then proves ({) to be the total space pr, and this remark finishes our demonstra- 
tion, at the same time yielding the equation 


d= nu: 


u is a divisor of d. 


3. Exploitation. Here we restate Theorem (1.3-B) of my former paper as 

Lemma (3-A). An irreducible matric algebra R is ~ ry where r is a division 
algebra (and v a natural number). 

From (2.1) there follow the equations 


[o* X R] ~ u[ 9], [o X R] ~ u[H). 


According to Lemma (1-A), the algebra [], is irreducible as well as [§]; and 
in view of Lemma (3-A) we may put our result into the equivalence 


(3.1) [P X R] ~ u-§, $ irreducible, 


holding for any two irreducible matric algebras P and R the first of which is 
normal. Our result implies the abstract statement: 

THEOREM (3-B). The algebra product of two simple algebras one of which is 
normal, is a simple algebra again. ‘ 

From this we could infer our concrete proposition that [P X R] is a certain 
multiple w of an irreducible $ by means of the general fact mentioned on 
page 714, l.c., that every representation of a simple algebra is a multiple of its 
irreducible representation. Our proof here, however, aimed directly at this 
concrete statement and yielded the further result that uw is a divisor of the 
degree d of R. 

Lemma (3-A) makes transition from division algebras r to simple algebras 2 
so easy that it is perhaps convenient to specialize our result (2.1) to the case 
R = r* rather than to generalize it to (3.1). Hence let us write down the 
(special X special)-equation 


(3.2) p* Xr*~u-H 
This leads back to the (general X general)-result (3.1) in the form 
(3.3) [oy X re] ~ U-[Glow. 


Transition from p* and r* to P = p, and R = r~* leaves the multiplicity u w- 
changed while replacing [$] by the likewise irreducible [G]ow. 

Concerning the (special X special)-case (3.2), I feel bound to make two addi- 
tional remarks. 

First remark. An invariant subspace of pr has a basis I, , --- , ln relative to 
the quasi-field of coefficients in p. However, we may exchange the rdéles of 
p and r and look upon p as a vector space and on r as a quasi-field of multi- 
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plicators or coefficients. { will then have an r-basis \,, ---,A, in terms of 


which 
Yim + gees + Yr, 


describes | with the coefficients y, ranging over r. The dimensionality of | is 
nb = vd, therefore d:6 = niv. 


As d = nu, we obtain the further relation 6 = vu and thus realize that u is a 
common divisor of d and 6. The same relationship prevails in the (general X 
general)-case (3.1). For in passing from p* to P = pe and from r* to R = re, 
the degrees 6 and d change into év and dw respectively, while u stays put, eq. 
(3.3). With this additional information on hand we give the concrete counter- 
part of Theorem (3-B) as follows: 

TuroreM (3-C). The algebra product of two irreducible matric algebras P and R 
one of which is normal, decomposes into a number u of equal irreducible components 
¥ according to the equivalence 


[P X R] ~ u-§. 


The multiplicity u is a common divisor of the degrees of both factors. 

The u equal parts into which the generic matrix ! X C of P X R decomposes 
will occasionally be denoted by II(I, C). In the special case P = p* we simply 
write II(y, C) instead of II(y*, C), and similarly when BR is specialized into r*. 

Second remark. In the (special X special)-relation (3.2) or in 


[p* X r*] ~ u[$] 
we apply Lemma (3-A) to the irreducible [$] and infer from it that 
[9] = ». 


where the abstract division algebra p, called the R. Brauer product, is uniquely 
determined by the factors p and r. Comparison of degrees and orders in the 
ensuing equivalence 

[o* X r*] ~ up, 
leads to the relations 


bd = wb, bd = v’b, 
) being the degree of p* = order of p. Hence v = u and 
d = nu, 6 = wu, d = nv: 


THEOREM (3-D). The algebra product of the regular representations p* and r* 
of two division algebras p and r of orders 6 and d decomposes according to 


[o* X rt] ~ up 
provided p is normal. Putting 
= nu, 5 = vu, (n and » integers) 


the order of the division algebra » equals nv. 
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4. Adjunction. We have not as yet evaluated to the full the idea involved 
in our backbone proof that an invariant subspace I of pr can be referred to a 
p-basis l,, --- ,l,. Let us now consider its implications for the case which 
is the other way around: P X 7r*, P being a normal irreducible matric algebra, 
r an arbitrary division algebra. Let { be an invariant subspace of the vector 
space tr upon which the operators [ X c* of P X r* work (I are operators in 


the 6-dimensional space rt, r is of order d). {has an r-basisl,, --- , 1, such that 
each x of | is uniquely expressible as 
(4.1) x=yh+::-+yl, (y. in r). 


We now look upon tr = tr, as the vector space t under extension of its field 
of multiplicators k into the quasi-field r. The elements of tr, are rows x of 6 
quantities z;,---,2; in r. Addition is defined in the obvious manner, multi- 
plication by a quantity c of r as: 


c(t, , +++ , 2%) = (ct, --+ , CXS). 


A linear subspace I, of r, is a subset closed with respect to addition and multipli- 


cation by any c in r. The subspace [, has a basis 1, , --- , 1, as indicated by 
eq. (4.1). 
Each T is a linear substitution with ordinary numbers y, in k: 
wl = Dy he Vu (1, x = 1, +++ ,8) 


and hence commutes with all the multiplications x — cx. If [, is invariant 
with respect to the transformations [ of P, then each [: x — x’ carries the 
basic vectors 1, , --- , 1, into linear combinations of themselves: 


i= Dical, (.,x =1,---,»). 


Commuting as it does with the multiplications, [ then carries (4.1) into 


Y=eyhte--+yh=yht--- + yb 


Yo = Do clue (ix =1,--+,»). 


Here it is quite essential to write the coefficients c,, after or to the right of the 
variables y,: we therefore speak of a right-transformation. T induces in I, the 
right-transformation || c,. || and the correspondence I —> || cx || constitutes a 
right-representation of P in r. It seems worth while to present our chief result 
in this new garb: 

THEOREM (4-A). Under extension of k into a quasi-field r over k, a given 
normal matric algebra P, irreducible in k, breaks up into u equal irreducible right- 
representations of P in r. 

This mode of visualizing the situation is related to our former viewpoint in 
the following manner. Adopting the codrdinate system here used, I(T, I) 
arises from our right-representation T —> || c, || in replacing each cx by the 
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matrix (Cy)* (back multiplication and hence lower asterisk !) whereas II(E;, c) 


is simply Z, X c*. ‘ ‘ 

Of particular import is the case when r is a commutative field K. We then 
deduce from our theorem that a normal irreducible matric algebra P in k splits 
into w equal irreducible matric algebras in K after extending the reference 
field k to a finite field K over k. Let us consider again the special case P = p*. 


We then must have an equivalence like 
p* ext. to K ~ u-m, 


where 7 is a (normal) division algebra in K. 6 being the degree = order of x*, 
comparison of degrees and orders leads to the relations 


5 = wi, 5 = v8, 


hence 
“=v and 5 = wo. 


Turorem (4-B). A normal division algebra p in k breaks up according to the 
equation 
p* ext. to K ~ unre 


under extension of the field k into a finite field K overk. Hence 
(order of p) = u’-(order of 7). 


An easy consequence thereof is our final 

TuroreM (4-C). The order of a normal division algebra is a square number. 

Indeed, Theorem (4-B) informs us that on successive extensions of the ref- 
erence field the order of p shrinks by throwing off square factors. Therefore 
we merely have to show how to find an extension so as to effect actual reduction 
(u > 1) as long as one has not yet reached the core: p = (I). If p is not of 
order 1 we choose an element a of p different from any multiple of the unit and 
adjoin a root z of the characteristic equation g(z) = 0 of the substitution 
ax:£— fa. The transformation ax — zE# is then singular, ~ 0, and commutes 
with all y*; hence, according to Schur’s Lemma, p* must needs reduce in 
K = k(z). What one actually does is to pick out an irreducible k-factor y(z) 
of y(z) and then define k(z) in abstracto as the field of all k-polymonials of the 
indeterminate z modulo y(z). 

Here we find ourselves at the entrance gate to the “‘splitting fields,” and there 
we might well finish our brief journey over what seems to me a particularly 
smooth and open road through this well-explored territory. 

It was essential to suppose one of our factors to be normal. Again, the 
unruly things happen in the commutative fields: the algebra product of two 
fields breaks up into inequivalent parts, and to secure its full reducibility at all 
assumptions concerning separability are needed. 
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PSEUDO-LINEAR TRANSFORMATIONS’ 
By N. JacosBson 


(Received May 29, 1936) 


It is well known that two matrices 7; and T>2 with codrdinates in a commu- 
tative field are similar, i.e. there exists a non-singular matrix A such that 
T, = A 'T,A if and only if they have the same elementary divisors. In a 
number of problems occurring in complex projective geometry, differential and 
difference equations, and hypercomplex numbers a more general problem of 
similarity is encountered: namely, when are two matrices with co6rdinates in a 
non-commutative field § related in the fashion 


(*) T, = A’T,A* + AAs 


where A = (a;;) is a matrix in §, A* = (a%;), As = (aijs) and the a® and as 
are obtained respectively from the a by a fixed l-automorphism and differen- 
tiation in §? (§1 for definitions). In analogy with the special case of matrices 
in a commutative field under ordinary similarity, we define the elementary 
divisors of a matrix subject to the transformation (*) as certain non-commuta- 
tive polynomials and prove that a necessary and sufficient (n.a.s.c.) that 7; 
and T+ be similar is that their elementary divisors be similar (§8). 

As in the usual theory the present notion of similarity carries with it notions 
of reducibility, decomposability, and complete reducibility (§3). Criteria for 
these may be given in terms of the elementary divisors. In the important 
special case of finite matrices these criteria may be sharpened (§9). 

We consider also the automorphism ring % of 7’, i.e. the ring of all matrices A 
such that AT = TA* + Ag. This is a generalization of the ring of matrices 
commutative with 7 and has many interesting special cases. For example, 
cyclic algebras and the invariant ring of a differential polynomial as defined 
by Ore are included here. The general theory of the ring %{ may be deduced 
from results due to v. d. Waerden, Fitting and others. A more detailed dis- 
cussion is given of some of the special cases which have application to hyper- 
complex numbers and differential equations. 

In the body of the paper we take the abstract point of view, formulating 
the problems in terms of linear transformations or groups with operators. The 
concepts of similarity, reducibility, etc. appear naturally as special cases of 
the concepts of isomorphism, existence of allowable subgroups, etc. of groups 
with operators. This procedure has the added advantage of enabling us to 
apply directly the general theorems on groups to our case. The reader is 





1 An abstract of this paper appeared in the Proc. Nat. Acad. Sci. [7]. A portion of this 
work was done by the author as National Research Fellow at the University of Chicago. 
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referred to v. d. Waerden ({15] vol. I, pp. 132-144 and II, Chap. 15) for a dis- 
cussion of these notions. 


1. Definition of pseudo-linear transformations. Let ®t be a vector space of 
{nite dimensionality n over an arbitrary field § (not necessarily commutative). 
A pseudo-linear transformation (p.l.t.) T of R is defined by the following con- 
ditions: 

(1) (xc + y)T = «T+ yT r,yeR 
(2) (xa)T = (xT )a® + zazs 


8S: * ° 
where the correspondence a — a is a l-automorphism and a — as an S-differen- 
tiation in §, 1.€. 


(3) (a + B)* = a + 6 (ap)* = a'B* 
(4) (a + B)s = as + Bs (a8)s = asB° + as 


anda — ais (1 — 1). Equation (1) states that T is an automorphism of the 
abelian group t and (2) gives the commutation relation 


(2’) aT = Ta® + as 


between T and the automorphism zx — za of ®. We note the following ex- 
amples of p.lL.t. 

(a) a® = a, as = 0. In this case T is a linear transformation (lI.t.) in a 
vector space over any (non-commutative) field. The case § commutative is 
classical. 

(b) § the field of complex numbers, a* the conjugate complex of a, as = 0. 
Plt. of this type were first defined by Segre” and are of interest in complex 
projective geometry. If we suppose that § is any field, a —> a® any 1-auto- 
morphism, as = 0, we obtain as a generalization of the transformations of 
Segre a type of p.l.t. which we shall call semi-linear (s.].t.). Thus the trans- 
formation z + zp = 2M is an s.l.t. with automorphism a > a*® = pau. 

(c) The differential equations 


dr 


where the a(r) are analytic functions of 7, define a p.l.t. in the following way: 
For Rt we take the space of forms u = . u;a;(r) in the independent vectors u; 
with coérdinates @;(r) in the field % of analytic functions of 7. We define T by 


u—-uT =>, uza;(t) + Dd wiax(7) 
where us is given by (5).* T is easily seen to be pseudo-linear with the identity 
as its automorphism. More generally if § is any field a* = a we shall call the 
Corresponding p.l.t. differential (d.t.). 
es vanietetceenteneenidedeemn 


*C. Segre [14] and v. d. Waerden [16] p. 4. 
*Cf. Krull [8] p. 188. 


(5) ur a= dus = Dd uj aji(r) (i = 1,---,n) 
j=1 




















486 N. JACOBSON 
2. The matrices of a p.l.t. Let (e1, --- , en) be a basis for ® and Suppose 
that the p.l.t. T sends e;, into e,T where 
eT = eit + C2T% + °°* + Cntnk (k = 1, +++ \n) 


or, using the usual rule of matrix multiplication 








(6) (eT, ae enT) (e1, ry en) T T= (7;;). 
( &, ) 
If x is the vector ti e.§; = é , then according to (1) and (2) 
Es 
( et | (fs) 
Ss 
(7) ee bel Be 














En / LEns 


Thus the p.l.t. T is completely determined by its matrix in a fixed codrdinate 
system, its automorphism a — a’ and its differentiation a — as. Conversely, 
if T is any matrix and a — a* and a — ag any 1-automorphism and differentia- 
tion, the correspondence x — zT defined by (7) is a p.l.t. We restrict our- 
selves in the present paper to the study of p.1.t.’s with the same automorphism 
and differentiation and so, for the sake of simplicity, we write a® = @, as = a’ 


Let (er, --: , en) be a second basis for 9 related to (e:, --* , én) by 
(er, +++, en) = (€1,°-+ , en)A (ex, -**,€n) = (er, °°", ea)A~ 
where A = (a;;). If T and 7* are respectively the matrices of T relative to 
the bases (e,, --- , én) and (e;, --- , ex) a simple computation shows that 
(8) T*=A'‘TA+A™A’. 


3. Similarity, reducibility, decomposability and complete reducibility. If 
T is a p.l.t. we denote the ring of transformations‘ in 9 generated by T and the 
elements of § by §[T]. From the definition (1) and (2) it follows that the 
elements of §[T] may be represented as polynomials in T with coefficients (on 
the right of the powers of T) in §. We shall consider 9 as a group (XR, §[T)) 
with operators § [T]. 

We say that the p.1.t.’s T; and T, are similar if (9, §[Ti]) is isomorphic as 4 
group with operators to (8, %[Ts]) ie., if there exists a (1 — 1) mapping 
x — 2A of & on itself such that 


(9) (7+ yA =27A+ yA (xO,)A = (rA)Oz 





*The sum T; +'T: of two transformations is defined as 2(T: + Tz) = 2Ti + xT, the 
product as z(T: T2) = (xTi)T:. 
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where O, = Trao + Tr a + -++ + am (k = 1, 2). In order that (9b) hold 
it is sufficient with (9a) to require merely that (va)A = (xA)a and (2T))A = 
(rA)T:. Thus A is a non-singular 1.t. such that 


(10) T,A = AT2, T, = A T,A. 
It follows readily that the matrices T;, T2, A of T:, Tz, A satisfy the equation 
(11) T, =A °T,A+ AA’. 


Conversely, if the matrices of two p.l.t. (with the same automorphism and 
differentiation, are related according to (11), the p.1.t. are similar. 

If 2; is an allowable subgroup, or invariant subspace of ‘SR, i.e. a subspace 
which is transformed into itself by T, then T induces a p.l.t. T; in 8; which 
may then also be considered as a group with operators (9t:, §{Ti]). We call 
T, a contraction and denote its relation to T by T = Ti, or T; < T. In this 
case T also induces a p.l.t. in the projection space (factor group) R/9, which 
we denote as T/T,. If 9, and ®2 are two invariant subspaces, then so is their 
vector sum 9%; U 2 and their intersection #; NM M.. If T; and Ty. are the 
contractions of T in 9; and Re , we denote the p.1.t.’s corresponding to 2%, U ®R. 
and 9, M R. by T; U T, and T, /M Tz respectively. The set of contractions 
of T considered relative to the operations U and f/ is an instance of an abstract 
system of the type called a modular lattice by G. Birkhoff and a Dedekind 
structure by O. Ore.’ Its characteristic property is the Dedekind law: 

If T; 2 Ts, then T, N (T2 U Ts) = (Ti nN To) U Ts. 

IfT > T, (T 2 T, but T ¥ T;) where T; ¥ 0, then T is reducible. If a basis 
of 1 is supplemented to give a basis for Rt, it follows that the matrix of T 
relative to this basis has the form 


T. 
as) ( 1 5 
O T, 
where 7; is a matrix of T, and T2 of T/T, 7 The sequence of contractions 


T=Ti>T2>--->Ti> Tim =0 


is a composition series for T if each T;/Tis: is irreducible. 1 is the length of the 
series and T;/T;,; (¢ = 1, --- , I) the composition factors. The Jordan-Hélder 
theorem’ asserts that any two composition series for T have the same length 
and that their composition factors are similar in pairs. T is decomposable, 
T= 1, © T:, if T = T, U T, where T,; M T. = 0 and T; # 0. Ifa basis 
for 9 is chosen to consist of a basis for 9; plus a basis for I, then T has a 
matrix of the form (12) with Q = 0. Conversely, if T has a matrix of the 
form (12) ((12) with Q = 0), then T is reducible (decomposable). The decom- 
position of T may be continued to obtain T = T; ® Tz; @ --: © T,, where 
a aeceieaieseeeeessaeeenteninsiainninnas 


°G. Birkhoff [2] p. 445 and Ore [13] p. 411. 
v. d. Waerden [15] I, p. 140. 
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each T; is indecomposable. In this connection we have the theorem of Krull’ 
that any two decompositions of T into indecomposable parts have the same 
number of parts which may be paired off into similar pairs. If the indecom- 
posable parts of T are irreducible, then T is said to be completely reducible. 
It follows that T has a matrix of the form 


Ti 
T? 








Ty, 


\ 


where the 7’; are irreducible and conversely. 


4. Non-commutative polynomials. Let §[t, ", ’] = §[t] be the ring of non- 
commutative polynomials in the indeterminate ¢ with coefficients taken on the 
right in § and in which multiplication is defined by 
(13) at = ta + a’ 
$[t] is a domain of integrity with Euclidean algorithm. The ring §[T] asso- 
ciated with the p.1.t. is in a sense a representation of §[¢] and so it is natural to 
expect that the properties of the latter.will be useful in studying T. In this 
section we shall give a resumé of those properties of §[t] which will be required 
below referring the reader to a paper of Ore’s for a complete discussion.’ 

From the Euclidean algorithm follows the existence of a h.c.l.f. (herf. 


(a(t), B(t))z ((a(é), B(é)e) and a l.erm. (le.1.m.) [a(é), Ble (la), BOId) of 
any two polynomials a(t), B(é) in §[t]. We suppose these to be normalized, 
i.e. to have leading coefficient = 1. It follows that they are uniqve. The 
left-transform of a(t) by B(t) is defined as 


a(t) = B(t)*-a(t) -B() = p(t) [a(t), B(t) |e BS’ ao 


where r is the degree ca(t), d = (a(t), B(t))z, ao and By the leading coefficients 
of a(t) and A(t) respectively. Since 


oa(t) + oB(t) = ola(t), Ble + o(a(t), B(t)). 


we have oa(t) = r — d and.so the factor BS” ao gives a(t) the leading coeffi- 
cient a. If d = 0, ca(t) = om(t) and then a(t) and a(é) are said to be left- 
similar. We note the following rules for transforms: 


(14) Bi(t) *-(Bo(t)*-ex(t) -Bo(t)) -Bi(t) = (Bo(t)Br(t))* - x(t) - (Bo(t)Bx(2)) 
(15) B(t)*-[en(t), o2(t)]e-B(t) = [8(t)*-on(t)-B(t), BC) on(t) -BOle 
(16) B(t)*-on(t)aa(t)-B(t) = 0 (B(t)*- ar(t)-B()e- 


7 Krull [8] p. 186. 
5 Ore [12]. 
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Similarly we define right-transforms and right-similarity. Right-similarity and 
left-similarity are coexistential and so we shall speak simply of similarity without 
the modifiers and denote it by the symbol ~. If §[é] is commutative, ie. § 
is commutative and & = a, a’ = 0, then similarity implies identity. 

a(t) is reducible if it has a proper factor. If a(t) = [ai(t), ac(t)le, (a(t), 
a(t). = 1 and oa;(t) > 0, then a(t) is said to be decomposable into the left- 
components y(t), ae(t). In this case galt) = oa (t) + ca(t). If a(t) = 
n(t)a(t) = an(0)6x(2), then a(t) = [Bi(t), Bole, (Bi(t), Belt) = 1 and so 
a(t) is also decomposable into the right-components 6;(t), 62(t) and conversely. 
a(t) is completely reducible if a(t) = [m(t),---, wu(t)]e where the w(t) are 
irreducible. It may be shown that this is equivalent to a(t) = [p:(t), --+ , pu(t)|z 
where the p;(t) are irreducible and that every factor of a completely reducible 
polygomials is also completely reducible.” 

If the right-ideal (u*(#))e (the right multiples of u*(t)) is two sided, i.e. for 
every a(t) there exists a @(t) such that 


(17) a(t)u*(t) = w*OBO 


then we say that u*(t) is finzte. Another formulation of this condition is 
(17') B*(t) = 0 (a(t) *-u*()-a(?))a 


for every a(t). The correspondence A,+: a(t) — B(t) is a homomorphic map- 
ping of §[t] onto a subring G[t] where F — G, t — 4. However, it is easily 
seen that § = G. For the correspondent of a is B = p&p where s = op*(t), 
and p is the leading coefficient of u*(t) and so as a varies over the whole of §, so 
does 8. Since t; is of the first degree in t, §{t:] = §[t] and so A,+ is a 1-automorph- 


ism of §[t]. Thus for every A(t) there corresponds an a(t) satisfying (17), or 


(18) u*(t) = 0 (B(t)-u*(t)-B)) 


for every A(t). 

If 3 is a two-sided ideal in §[¢] and y*(t) an element of ¥ having minimum 
degree and leading coefficient 1, it is easily seen from the division process that 
every element of $ is both a r.m. and a lm. of »*(t). Hence u*(é) is uniquely 
determined by $ and if a(t) is arbitrary a(t)u*(t) = u*(t)B(0), ie. u*(é) is a nor- 
malized finite polynomial. Thus we have a (1 — 1) correspondence between 
the two-sided ideals and the elements u*(t) and the theory of the latter which 
we shall dev velop here is equivalent to that of two-sided ideals. 

J My (), and ps > (t) are finite, then 50 is 3 (t) = wi (t)u2 (t) and A.W =A,A,;. If 

K() = ut (t)ua (t) where y3 * (1) and u(t) are finite, then so is n2(t). Let Be denste 
th totality of normalized finite elements. %* is closed under multiplication. 


—_—. 








*Up to this point we have given an abstract of Ore’s results. The rest of this section 
I believe is new. 
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If u(t) is a Lf. of a u*(t) « B*, say w*(t) = u(t)r(t), then u(t) is bounded. 


Suppose 
6(t) = (u(t), w*O)x = a(t)u) + on(t)u*(), 
then 
5(t) v(t) = u*@)[Bi(t) + Bo(t)O)] 


where f(t), Bo(t) are given by (17). Thus od(t) = ou(t) and y*(t) = »(t)u(t): 


If u(t) is a Lf. (rf.) of a finite polynomial y*(t), then it is also a rf. (lf.) of u*(t). 
Let u*(t) be a polynomial of least degree in B* divisible by the bounded 
polynomial u(t). Equation (16) shows that u*(f) is divisible by 


wit) = [u(t), a(t) *-u(t)-a(t), B(t) u(t) -B(A), «++ de 


where 1, a(t), B(t), --- exhaust all the elements of §[t]. It follows from (14) 
and (15) that 


wilt) = 0 (a(t) -m(t)-a(t))e 


for every a(t) and hence w(t) « B*. Since w*(t) was assumed to be minimal, 
w(t) = yw*(t) and this polynomial is unique. We shall call it the bownd of u(t). 
It may be characterized also as 


[u(t), a(t)-u(t)-a(t)*, B(t)-u(t)-B()™, «++ Le. 


It is easily seen that the product of two bounded polynomials s:(t)us(é) is 
bounded with bound of degree « < oy; (t)us (4) where u; (é) is the bound of u;(t). 
We denote the multiplicative system of normalized bounded polynomials by ¥%. 

Lemma 1. If x(t) ts an irreducible bounded polynomial, then its bound r*(t) 
is irreducible in B*. 

If r*(t) = 1; (t)m2 (t), om: (t) > 0, then x(é) can not be a factor of 7; (t) because 
of the minimality of r*(t). Hence (x(t), 71 (t)) = 1 and x(t) = ri (t) a(t) () 
is a factor of w(t) again contrary to the minimality of 7*(é). 

Lemma 2. %* is a commutative multiplicative system. 

Any polynomial in $* may be factored into a product of polynomials which 
are irreducible in B*. It is therefore sufficient to prove that every pair of 
irreducible polynomials in 8* are commutative. If x; (¢) = 72 (4), this is evident. 
If mi (t) ¥ a(t), then (ri (t), 72(t)) = 1 for otherwise their common factor 
would have two distinct bounds. Hence 


mim) = 12 Oper = [xT O), 72 Ole 


and similarly +2 (t)m}(t) = [xi (t), 2 (le. Thus i (t)e2(t) = 72 (t)mi (0). 
Coroutuary. If ui (t) and us (t) « B*, then A,* A,* = A,*A,*. 
THEOREM 1. The polynomials in B* may be factored uniquely into irreducible 
polynomials in B*. 
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Let 
u*(t) = mi (t)ma(t) «++ w(t) = pr (pz (0) «+ pr(t) 


be two factorizations of u*(t) into factors irreducible in B*. If x(t) is a divisor 
of *(t) irreducible 1 in §[t], then by the argument in Lemma 1, x(é) is a divisor 
of one of the x; (¢) and of one of the p; I(t). Since we can permute the x*(t) 
and the p*(t), we may assume that 47=j=1. By the uniqueness of the bound 
of x(t), we can conclude that m T(t) = pi(t). We cancel off this factor and 
repeat the process to obtain the theorem. 

We conclude this section with an explicit determination of the elements of 
¥* in certain polynomial rings. 

1. §[t] semi-linear, i.e. has a generator ¢t such that at = ta. Let 


(19) =O + tn + oe + te 


«%*. The condition tu*(t) = u*(t)t, gives 4, = t and hence fj = u;; ap*(t) = 
u*(t)q, implies a; = &” and hence &”" "yn; = wa”. Thus if uw; + 0 


~ (m) —1~(m—i) ~ (i) -1 
a = Wi a@ Mi or a” = Bi ay; 


ie. a > &” is an inner automorphism. If q is the smallest integer such that 
a — &” is an inner automorphism, then i = 0 (mod q). q will be called the 


relative order of the automorphism. It is easily seen that u*(t) has the form 


(20) u(t) = t™ + ty, +t ne + ee 
where 
(21) D5 = v; vj;a = av; 


for all a. Conversely these conditions are sufficient to insure that u*(t) « B*. 

TaeorEM 2. Let. §[t] be semi-linear. If the automorphism of §[t] has relative 
infinite order, B* consists of the powers of t only. If the automorphism has relative 
finite order q, then (20) with (21) gives the form of the elements of B*. 

2. §lt] differential, ie. at = ta + a’. If there exists a p such that a’ = 
pa — ap for all a, then we shall call a — a’ an inner differentiation. In this case 
we may replace ¢ by 4; = ¢ + p and obtain at; = tha. §[t] is then of the type 
considered in 1. We therefore assume that is an outer differentiation. 

Let u*(t) of the form (19) be an element of B*. ¢tu*(t) = yu*(t)t implies 
h=tand yj; = 0; av*(t) = u*(t)a, implies a; = @ and since 


(22) = tie mm (3 Yea! + f ital +: . Sere © a” 
we have among other conditions 


ma’ = Mia — ayy. 


If the characteristic of %, x(%) = 0, it follows from the assumption that a — a’ 
is outer that m = 0. 
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TueorEM 3. If §[t] is differential of characteristic 0 and with an outer differ- 
entiation, then B* contains the identity element only. 

If x($) = p # 0, the conditions that »*(t) « B* are somewhat complicated 
but may be satisfied non-trivially. We hope to discuss this case in a later paper. 


5. Matrices with elements in §[t]. Let § [ln denote the ring of matrices 
of n rows and columns with coordinates in §[é]. If U(t) « §[t,] and has an 
inverse U'(t) in §[t],n, then U(é) is a unit in F[f],. If Bid) = U()A(t) V(t) 
where U(t), V(t) e« U, the multiplicative group of units, then B(¢) and A(t) are 
associates. This relation is evidently reflexive, symmetric and transitive. 

We consider the following problem: Given A(é), to select an associate of 
A(t) which has a simple normal form and to determine to what extent this 
normal form is unique. The result’’ given by Wedderburn and by van der 
Waerden in this connection is the following 

Lemma 3. There exists an associate B(t) of A(t) having the form 


(8,(t) \ 
B2(t) 


(23) B(t) = x0 
0 








[ 0 
where B;(t) is both a Lf. and a rf. of B,(t) for j > 7. 
We shall prove an extension of this, namely 
TuHrorEeM 4. The associate B(t) of the diagonal form (23) may be chosen so 
that each B;(t) for i < r is bounded and the bound 8; (t) is a factor of 8,(t) for j > @. 
(1) We begin with the associate B(é) in (23). If every left-transform of ;(!) 
is a Lf. of 6o(t), B:(t) is bounded and its bound 8; (é) divides B2(t) and hence also 
B,(t), 7 > 2. Consider the matrix 


\ 


(B.(t) 


B,-(t) = 











1 In general if % is a ring %, will denote the ring of all matrices of n rows and columns 
with codrdinates in Y. 

11 Wedderburn [17] pp. 139-141 and v. d. Waerden [15] II pp. 120-125. Wedderburn’s 
result is in fact an extension of Lemma 3. However, it too is included in Theorem 4. 
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of Flt]. Using induction on the number of rows we may conclude that 
B,1(t) has an associate 


-ya(t) a 


7r(t) 


C,-1(t) = U,-1(t)Bn—1(t) Vna(@) = 0 








“f 


\ 


where 7; (t) is a factor of y,(t) fori <j, <r. Then 


(y i(t) 7 
2(t) 


1 1 ° 
ee ( — . o/ dass . aa 








\ 


where y:(t) = 6:(t). Since y,(t) for 7 > 2 are linear combinations of 


Be(t), i, oe B-(t), 


it follows that y;(t) is a factor of y;(t) and hence the associate (24) of A(#) has 
the required normal form. 

(2) Suppose next that the transform \(é)~’-8,(t)-A(t) is not a Lf. of Be(é). 
Then 6;(t) is not a Lf. of A(é)Be(t) = E(t). Let 6(t) = (6:(4), E(é))z and A(t) = 
d(t)u(t), E(t) = 4(£)v(t) where (u(t), v(é))c = 1. It follows that m(2), f:(é), m(2), 
{:(t) can be determined so that 


u(t)m(t) + oe) = 1 u(t)na(t) + r(t)to(t) = 0 


and so that 


W.(t) = ar ai) 


fit) f2(t) 


einai 





“The rank r — 1 is invariant: Wedderburn [17] p. 140. 
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is a unit in $[th.* Thus 




















(Bi (t) 
‘1 Ad) : Be(t) (mi(t) mt) 
0 1 ea fit) Salt) 
1 B,(t) 1 
‘ , 
1 oo! 1 
\ 0) 
(a(t) 0 ) 
Bolt)gi(t)  Be(t)s2(t) 
Bs(t) 
. B,(t) 


0 








0 
\ 
is an associate of A(t). Since the degree od(t) < ofi(t), it follows from the 
usual proof of Lemma 3 that there exists an associate C(t) of the form (23) 
with oyi(t) < ofi(t). We repeat the argument with C(é) in place of B(t). 
After a finite number of repetitions of this process we arrive at an associate 
of A(t) of the form (23) in which the first element has a bound which divides 
the succeeding elements. The theorem then follows from (1). 

An associate (23) satisfying the conditions of Theorem 4 is a normal form 
for A(t). The polynomials 6,(t) may be taken to have leading coefficients = 1. 
When this has been done the 6,(t) ¥ 1 are called the invariant factors of the 
normal form. 


6. The invariant factors of a p.l.t. We return to the discussion of the p.|.t. 
T whose matrix 7’ relative to the basis (e,, --- , én) of R is given by (6). 

Let #[t] be the extension of ® obtained by allowing the coefficients to range 
in §[¢]: R[t] is the totality of forms x(t) = >. e;£;(t), &(t) € Flt]. This definition 
is evidently independent of the choice of basis of R. R[é] is an abelian group 
under addition. The correspondence z(t) — zx(é)a(t), a(t) « F[t] is an auto- 
morphism of this group. Thus ¥[t] may be looked upon as an abelian group 
with operators §[é], (R[é], Fé). 





13 Wedderburn [17] p. 134. 
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With z(t) = > e:t:(t) we associate the vector = >> e;¢;(T) and with the auto- 
morphism x(t) > 2x(t)e(t) we associate the automorphism x — za(T). These 
correspondences define an operator homomorphism of (%[¢], §[¢]) into (KR, ¥(T])."* 
Let 9 denote the allowable subgroup sent into 0 by this homomorphism and 


define f@, rEg fnt) by 
(25) (fid), i » fn{t)) = (a1, _— »en)(T = t1) 


where 1 denotes the unit matrix. 
Lemma 4. (fi(t), +--+, fn(t)) ts an independent basis for N. 
The correspondent of f;(¢) in the homomorphism of (R [t], F[#]) and (MR, F[T]) is 


etiy beet Heparin + e733 — T) + esa ti4,7 + +++ + en tay = 0 
because of (6). Thus f,(t) eM. If do f,(t)s,(t) = 0, then 
(26) t6,(t) = Do r48:(t) (j =1,---,n) 


since the e’s are independent. If any 6(t) ¥ 0, let 6,(¢) be one of maximum 
degree. The equation (26) for 7 = s is impossible. Hence all 6(¢) must be 0 
and the f,(t) are independent. Now let x(t) = y be e;&:(t) be any element of 9 [f¢]. 
By means of the relations 


(25') et = fi) + Levris (j= 1,-+-,n) 
we may express x(t) in the form 
x(t) = Do fidvit) + Desk 


where £:¢§. If a(t) eM, we have xz = > e; t£(T) = 0 = >. e;£; since f(T) = 0. 
Thus & = 0 and x(t) = >> f;(t)y.(t) as was to be shown. 
We may replace the bases (€, --- , én) and (fi(t), --- , f(t) of Rlé] and N 
respectively by 
(er (t), fia en(t)) = (41, i en)U ‘(t), 


(O, +--+, SO) = (AO, «+> fO)VO, 
where V(t), U“(t) are units in §[#],. Then (25) becomes 
(ff, +++, fr) = (FO, +++, ex®)T*O 
T*(t) = U@®(T — 11)V@. 
In view of §5 we may choose U (t) and V(t) so that 7*(t) has the form 
6 \ 


where 


1 


ra A a(t) 








ee ee \ pr(t)) 


“This is essentially the method used by Krull [9] pp. 22-32 to obtain the canonical 


form of ordinary linear transformations. 
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where the invariant factors u(t) are bounded with bounds ui () dividing 4,(t) 
if i <r, <j. We shall call the set {us(t)} (k = 1, ---, r) a set of invariant 
factors of the p.l.t. T. 

Referring back to (R, §[T]) which is operator aye to the factor group 
(R (t], 51) /m, we may conclude that the vectors e7, ---,e. corresponding to 
er(t), «++ , ex(t) generate (MR, F[T]) in the sense | each of its vectors may be written 
as ; e; &(T). Since e 1, : ++ ex_,(t) eM, er = 0, ,€x-r = 0 andso may 
be discarded. We denote en—,+x by gz (k = 1, aie If >> 9x5:(T) = 0, we 
have, since 


AO =, +++, freO = en), far) = Cnr Qan(t), «+, AO 
= €5(t)ur(t) 
form a basis for Jt, that 6,(t) is a r.m. of uz(t). It follows that if 
mn) = ED 
then 
(91, T, -**,g:T" 5 +++ 5 Gr, 9rT, +++, grT) 


is an ordinary independent basis for #. Thus > ni =n. Relative to this basis 
T the matrix of T has the form 





T ) 
T2 
(27) ) 
oe 
where 
( \ 
O-- ++ 7% 
1 O 
wo 








The invariant subspace Ri = (gi, gx T, --- ) is generated by the single 
vector gx (acted upon by §[T]). Let T;, denote the contraction of T in ®. 
Thus if g eR. , gT. = gT. We shall call ®;, a cyclic subspace and T; a cyclic 
p.l.t. The polynomial y;(t) is the order of the generator g, of Ri. The above 
result may be stated as 

THEOREM 5. T = T; © T2 © --- @ T, where the T; are cyclic with generators 
gi whose orders y,(t) are the invariant factors of a normal form of the matrix T — tl. 


7. Cyclic p.l.t. In this section we suppose that T acting in 9 is cyclic and 
that g is a generator of 9 whose order is u(é). ou(t) =n. If h = ga(T), its 
order is readily seen to be v(t) = a(t)*-u(t)-a(t). Thus h is a generator of w 
if and only if ov(t) = ou(t), ie. u(t) ~ v(t). Hence 
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Lemma 5. A n.a.s.c. that two cyclic p.l.t. be similar is that the orders of their 
corresponding generators be similar polynomials." 

The vectors of § may be represented uniquely in the form gd(T) where 
\(t) is of degree <n. Now let %: be an invariant subspace of # and T; the 
contraction of Tin #1. Let gi: = gx(T;) be a vector of 9; for which the poly- 
nomial \;(é) has minimum degree for the vectors of ti. It is an immediate 
consequence of the division process that every vector of ®t is a “multiple” 
of g., i.e. T; is cyclic with generator g:. Further, the order y(t) is a r.m. of 
i,(t), u(t) = (mt). The right associate of u:(t) whose leading coefficient 
is 1 is the order of g: . 

Lemma 6. A n.a.s.c. that the cyclic T be irreducible is that the order u(t) be an 
irreducible polynomial. 

The above results establish a correspondence between a factorization of y(t) 
into irreducible factors and a composition series of T. From this correspondence 
it is easily seen that Ore’s theorem on the unique factorization (in the sense 
of similarity) of polynomials in §[é] is precisely the Jordan-Hélder theorem 
applied to cyclic p.l.t. A similar connection exists between the concepts of 
decomposability applied to polynomials and to p.1.t. 

Suppose that the order u(é) is decomposable, say 


a(t) = Ar(E)ui(t) = Ac(E)ue(t) 


(Ar(t), Ae(t))c = 1 = (ui(2), weo(t))e 


and all of these polynomials are normalized. Set gi = gdi(T), ge = gd2(T) and 
let %: and Qe be the cyclic subspaces generated by these vectors. We may 
determine »;(t), v2(t) so that 


Ai (t) 1 (t) 4- Ao(t) v2(t) = 1 


where 


and hence 
g = 9v(T) + gev(T). 


By comparing the dimensions of ®, 91, and 2 we obtain RN: M MR, = O and 
hence T = T; ®© Tz. Conversely, if T = T; © T. and correspondingly 
R= Ii @ Re, let gi = gd(T), go = grdo(T) be generators of I, and M2 such 
that A(t) and As(é) are normalized 1.f.’s of u(é). The orders s(t), uo(t) of gs 
and gz are then normalized r.f.’s of u(é). Since g = gin(T) + geve(T), we have 


(Ar(), A2(t)). = 1, [ui(t), ma(t)|. = a(t). 
Since 2, N RN. = 0, 


ou(t) = om(t) + ope(t) 
and hence 


(ui(é), ue(t))e = 1. 
Thus we have proved 
onesies diate 


* Noether-Schmeidler [10], p. 21. 
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Lemma 7. A n.a.s.c. that a cyclic T be indecomposable is that u(t) be an inde- 
composable polynomial 

Lemma 6 and 7 give 

Lemma 8. A n.a.s.c. that a cyclic T be completely reducible is that u(t) be a 
completely reducible polynomial. 


8. Criteria for similarity, reducibility, etc. Theorem 5 enables us to apply 
the lemmas of the preceding section to arbitrary p.l.t. For according to this 
theorem a p.l.t. T is decomposable unless it is cyclic and the condition that it 
be cyclic is that the normal form of 7’ — #1 have but one invariant factor, 
i.e.r = 1. Thus Lemma 6 and 7 give respectively 

TuroreM 6. A n.a.s.c. that T be irreducible is that it be cyclic and its order 
u(t) be an irreducible polynomial. 

THEOREM 7. A n.a.s.c. that T be indecomposable is that it be cyclic and u(t) 
be an indecomposable polynomial. 

The condition that T be completely reducible is, according to Lemma 8 that 
all of the invariant factors u;(t) be completely reducible. However, if we recall 
that every factor of a completely reducible polynomial is completely reducible, 
we obtain 

THEOREM 8. A n.a.s.c. that T be completely reducible is that the last invariant 
factor u,(t) be a completely reducible polynomial. 

Let 


(29) T=T,.0T2@::-@Ty, 


be a decomposition of T into indecomposable parts. By Theorem 3 each T; 
is cyclic and the order p;(¢) of its generator is indecomposable. _p;(¢) is uniquely 
determined in the sense of similarity by T;. We shall call these polynomials 
the elementary divisors of T. In virtue of the theorem of Krull that the inde- 
composable parts of an abelian group with operators are unique in the sense 
of isomorphism, we have that the T; in (29) are unique in the sense of similarity 
and hence the elementary divisors are determined in the same sense by T. 
Hence 

THEOREM 9. A n.a.s.c. that two p.l.t. be similar is that their elementary divisors 
be similar in pairs. 

The following theorem gives some additional information regarding the in- 
variance of the invariant factors y;(t). 

TuEoreM 10. T, is a cyclic contraction of maximum dimension n, of T. If 
X(t) is the order of any cyclic contraction of dimension n, , then d(t) = x” (url), 
u(t) = uw (t)urs(t) where us_1(t) is the bound of u,(t) and due p(t). 

In the notation of Theorem 5 suppose T = T,; @ Tz @ --- @ T, where Ti 
has generator g; of order u,(t). Let A(t) denote the order of 


g = g.m(T) + --- + g-a,(T). 


The order of gia;(T) is \:(t) = a;(t)’- “mi(t) a(t) and X(t) = Du), , Ar(t) Je - 
Since [Ai(é), --- , Ara(Je is a factor of ura(t), A(t) is a Lf. of ut), d d,(t)]e- 
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Since u(t) = u(t)ura(), Ar() is a Lf. of A? ()ur.(t) where (6) 
a(t): (t)-ar(t). Thus d(£) is a Lf. of A? ()ura(t). Since or? (t) < ou (t) 
we have od(t) S our(t). If equality is to hold we must have oA" (t) = on (2), 
ie. MO) ~ BE). 

Special cases. 

(1). If T is an lt. and § is commutative, §[t] is commutative and similarity 
implies identity. ‘Theorem 9 shows that the elementary divisors are a complete 
set of invariants of T and Theorem 10 may be extended to give the same result 
for the invariant factors. 

(2). If T is an s.l.t. with automorphism of relative infinite order, the finite 
elements of §[é] consist of the powers ¢”. Hence the bounded elements are 
also of this form. Thus in the decomposition T = T; @ --- @ T,_, @ T, of 
Theorem 5 T; ® --- @ T,_1 is nilpotent and T, is cyclic. 

(3). If Tis a d.t. and § has characteristic 0, 1 is the only normalized bounded 
element in §[¢]. Thus r = 1 and T is cyclic. This result gives the well-known 
equivalence between the theory of systems of n linear homogeneous differential 
equations (analytic case) and that of a single n-th order linear homogeneous 
differential equation. 


9. Finite p.l.t. The homomorphism §[¢] ~ §[T] between the polynomial ring 
and the ring of a p.l.t. defines an isomorphism §[T] ~ §[t]/% where & is a 
two-sided ideal. If T ¥ 0, then § ¥ (1). Thus either § = (0) or ¥ = (u*(é)) 
where »*(t) is a normalized finite polynomial ~0, ~1. In the latter case we 
shall call T finite and p*(t) its minimum function. y*(t) may be defined also 
as the normalized polynomial of least degree such that ru*(T) = 0 for all 
tin R. 

Let T be finite and decomposed as T; @ --- @ T, into the cyclic parts T; 
with generators g; whose orders y,(t) are chosen as in Theorem 5. We have 
g-o(T)u*(T) = 0 and hence u*() is a r.m. of a(t) *-u,(t)-a(t). Since a(t) is 
arbitrary, it follows that y,(t) is bounded with bound yu; (t) a factor of u*(t). 
On the other hand it is easily seen that zu;(T) = O for all z. Thus u>(t) = u*(t). 

TaEorEM 11. A n.a.s.c. that T be finite is that u,(t) be bounded. If the condi- 
tion is satisfied, the minimum function of T is u> (2). 

Suppose u*(t) = wy (t)"! we (t) +++ a(t) where 2; (é) are distinct and irre- 
ducible in 8*. Set wi (#)* = pi (t) and y*(t)p: (t) = 7: (t). Let RF denote the 
subspace of % consisting of all vectors of the form x7;(T). Since 7; (t) is finite 
i is an invariant subspace. If 2; eR; then z;p:(T) = 0 and since (p; (2), 


ri(t)) = 1, 
RNR U--- URE U REL U --- RR) = 0. 
Since 
(ri (2), ee rn(t)) = 1, 
R= RFU---U HK 
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and correspondingly 
T=T0T2@:-:: @Tn. 


Thus we have proved 

TuHrorEM 12. If T is finite and indecomposable, ther u*(t) is a power of a 
polynomial irreducible in B*. 

The following is an extension of a well-known result on ordinary linear trans- 
formations. 

TurorEM 13. A n.a.s.c. that a finite T be completely reducible is that its mini- 
mum function have no multiple factors in B*. 

If T is completely reducible, its elementary divisors are irreducible poly- 
nomials and hence their bounds are irreducible in G*. It is readily seen that 
u*(t) is the l.c.m. of these bounds and hence has no multiple factors in $*. 
To prove the converse we note that if w(¢) is an elementary divisor of T, its 
bound z*(t) is a factor of u*(t) and hence is irreducible in 8*. Since 7*(é) can 
be defined as a l.c.m. of irreducible polynomials, it is completely reducible. 
Hence its factor z(t) is also completely reducible, but being indecomposable, 
it must be irreducible. Thus every elementary divisor of T is irreducible and 
so T is completely reducible.” 

Lemma 9. If p(t) is a normalized indecomposable bounded polynomial and 
p*(t) its bound, then the indecomposable parts of p*(t) are similar to p(t). 

Suppose \,(t) is a normalized polynomial decomposable into k parts similar 
to p(t). z(t) is a factor of p*(t). Suppose dA,(t) ¥ p*(t) and consider the p.l.t. 
T defined as the direct sum of (decomposable into) the two cyclic p.|.t.’s whose 
orders are p(t) and \,(¢). Thus T has a matrix of the form 


T= f °) 
0 
where R and L; are associated with p(t) and ,(é) as in (28). We assert that T 
is cyclic. For otherwise the number r of invariant factors u;(t) in Theorem 5 
is >1. By Theorem 10 u,(¢) is the order of a cyclic subspace of maximum 
dimension of T and if ou,(t) = od,(t), dz(t) is divisible by u;-.(f). We obtain 
a decomposition of T into indecomposable parts by decomposing the p;(C). 
Thus y;-:(t) is divisible by a polynomial similar to p(t) and so ),(é) is divisible 
by p*(t) contrary to the assumption that p*(t) ~ ,(#). On the other hand if 
ou,(t) > odx(t) we must have ou,;_s(t) < op(t) and hence T has indecomposable 
parts not similar to p(t) contrary to Krull’s theorem. Thus r > 1 is impossible 
and T is cyclic. Let dz4(t) denote its order. z(t) is decomposable into 
k + 1 parts similar to p(t) and hence divides p*(t). If Azsa(t) ¥ p*(t) we repeat 
the process. If we start with \i(¢) = p(t) we arrive in this way after a finite 





16 The remainder of this section was added Aug. 14, 1936. These results are related 
in part to some recent work by Prof. M. H. Ingraham and Dr. M. C. Wolf on matrices 
with elements in a division algebra. I am indebted to Prof. Ingraham for communicating 
a copy of their manuscript to me. 
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number of steps at p*(t) and obtain a decomposition for it into parts similar 
t). 

“= ammo consequence of Lemma 9 and Krull’s theorem is 

Lemma 10. If pi(t) and p2(t) are indecomposable bounded polynomials with 
the same bound, then p(t) ~ pelt). 

From this and Theorem 9 we have 

TurorEeM 14. A n.a.s.c. that two finite p.l.t. be similar is that the bounds of 
their elementary divisors be identical. 

Let x(t) be any irreducible bounded polynomial with bound z*(¢) and p,(t) = 
m(t) «++ we(t) where the w(t) ~ w(t). Each 7,(t) divides r*(t), say x*(t) = 
ri(t)y(t) = 6:(t)r.(é). Suppose 


r*(t)*" = pea(ta(t) ~ pea(t) = mi(t) «++ mea(t). 
Then 
a*(t)° = pes(t)a(t)r*(t) = pea(t)e*()B(t) = p-(t)y-(t)B(0). 


Hence p. (t) the bound of p.(#) is x*(t)’ where f se. We shall show next that 
m(t), «++, me(t) may be so chosen that p.(t) = x*(t)’. Suppose p-,_i(t) has 
already been found so that peat) = n*(t)°. There exists an irreducible 
1(t) ~ x(t) such that the bound of p.(t) = p.«(t)z-.(t) is r*(t)°. For otherwise 
all the p.(t) divide +*(t)*” and hence their l.c.r.m. p--:(t)x*(t) divides x*(t)*™. 
But then p..(t) divides 2*(t)*” contrary to assumption. We require these 
remarks to prove the following extensions of Theorem 12. 

THEOREM 15. A n.a.s.c. that the bounded p(t) = m(t) --- w(t) (ai(t) irre- 
ducible and ~ to (t)) be indecomposable is that its bound p(t) = x*(t)’. 

Sufficiency. If pe(t) = [0%1’(t), pts (Je where p{}(t) = mi?(t) --+ wi?) and 
r\'(t) ~ w(t) then S(t) divides x*(t) and hence p,(t) divides x*(t)’ where 
f = max (e, @). Hence we can not have both e and e2 < e. 

Necessity. Suppose p.(t) = 2*(t)’ where f < e. We have seen that a poly- 
nomial p(t) = a(t) «+> 1;(t) exists having the bound *(t)’. It follows that 
p;(t) is indecomposable and by Lemma 10 is ~p,(t). Thus e = f. 

TuEorEM 16. A n.a.s.c. that a finite polynomial be indecomposable is that it 
be divisible on the left (right) by only one normalized irreducible polynomial. 

The condition is evidently sufficient. Now if p.(t) is indecomposable and 
divisible by the distinct normalized irreducible polynomials 7(t) and 72(2), 
then p.(t) is divisible by their l.c.r.m. u(t), ie. pe(t) = u(t)ms(t) --- w(t). Since 
u(t) divides r*(t) it follows easily that p(t) divides x*(t)** contrary to 
Theorem 15. 

_ As a consequence of this theorem we note that if p.(t) = mi(t) --- m(t) is 
indecomposable, so is px(t) = mi (t) uF x(t), 1 S k . é. 

If 2(T) is a variable element of §[T], the correspondence x(T) — 2(T)T is a 
pt. T determined by T in the vector space §[T]. The correspondence T — T 
defines an isomorphism between §[T] and %[T] analogous to the regular repre- 
sentation of a hypercomplex system. By means of this isomorphism Theorems 
12 and 13 yield results on the structure of $(T]. Thus it follows that a n.a.s.c. 
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that §[T] be a semi-simple ring is that »*(¢) have no multiple factors in §*. 
This will also be a consequence of a general theorem of the next section. 


10. The automorphism ring of a p.l.t. An (operator) automorphism A of 
(R, &(T]) (or of T) is defined by the conditions 


(30a) (c+ yA = 2A+ yA 
(30b) (cA)O = (xO)A 
for every O in §[T]. It is sufficient to require 
(zA)a = (ra)A (xA)T = (2T)A 


in place of (30b). Thus A is an 1.t. commutative with T and conversely. 
Let A be an arbitrary |.t., T a p.l.t. and z a vector such that (xA)T = (2T)A. 
Then 
(xa)TA = (xTA)& + (rA)a’ (xa)AT = (cAT)&@ + (rA)a’ 
so that (ra)TA = (ra)AT. Thus the totality of vectors for which A and T 
commute constitutes a vector subspace of #. Hence A is an automorphism 
of T is and only if e,TA = e,AT (kK = 1, --- , m) where (e, --- , én) is a basis 
for ®. This condition in terms of the matrices A, T' of A and T is 
(31) AT=TA+A’". 
In a similar fashion we define an automorphism A(t) of (R[t], Flt]) by 
[x(t) + yDJAM = cHA + yYOAO 
[cHa()| A) = [tHAM] a(t). 
If (e:(t), --- , en(t)) is a basis of R[t] then A(é) is determined by the matrix 
A(t) given by 
(a(MAQ), --- , en(AM) = (a0), --- , ent) AM 
and conversely any matrix defines an automorphism. If A(¢) leaves Jt invariant 
where (%, §[T]) ~ (R[é], Flé])/M, then A(é) induces an automorphism in the 
quotient group (%, §[T]), the projection of A(t). Suppose 
(fi), ti » In(t)) a (a(d), pedis en(t)) T(t) 


is a basis of Jt. We recall that T(t) is an associate of T — tl. The condition 
that A(t) leave 3 invariant is 


(32) AQT) = TWA). 
The projection A of A(é) is 0 if and only if 
A(t) =0 (T@)e 


where (7'(¢))z denotes the ideal of right multiples of F(t). 
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Conversely if A is any automorphism of (8, §[T]), it may be extended to 
an automorphism A(t) of (R[#], Flt). For if x) = e&(t) + --+ + enEn(t), 
then we may define A(t) by 


a(t)A(t) = e Agi(t) + --- + en, Aé, (2). 


This is independent of the choice of the basis (e¢, ---,e,). Now if x(t) « 2. 
ie. De: £(T) = 0, then since A commutes with all £;(T), x(t)A() also eR. Thus 
A may be obtained as a projection of an A(¢) leaving 9t invariant. 

Let X(T) = % denote the ring of automorphisms of T. The above dis- 
cussion shows that 9 is inverse isomorphic to the ring of matrices with elements 
in § satisfying (31) and also inverse isomorphic to the ring of matrices with 
elements in §[¢] satisfying (32) taken mod (7 (t))z. The latter will be called 
the invariant ring of T(t).” 

TurorEeM 17. Jf T; ~ Te then A(Ti) ~ A(T»). 

For T; = A T,A and hence if A; e Y(T,), As = AA, A « W(T2) and con- 
versely. 

ToeorEM 18. If T is crreducible, A(T) is a field. 

If Ae X(T), the space RA consisting of all vectors xA is an invariant sub- 
space of R. Hence either RA = 0 or RA = RK, ie. A = O or A has an inverse 
A’. Evidently A™ also ¢ %(T). 

The following theorem, a special case of a theorem on the automorphisms 
of abelian groups due to v. d. Waerden, gives the structure of Y(T) = A fora 
completely reducible p.1.t. 

TuHrorEeM 19. Let 


T=(TY @--- @ Ti?) @.:.@ i> ®---@ = 
where each TS” is irreducible and Ti” ~ TS” but TS” 2¢ Ti? if j # it. Then 
(33) Y= AX @O---@y™, 4° ~ Df? 


where D” is a field, the automorphism ring of TS”. 

For a proof of this theorem the reader is referred to v.d. Waerden [15] vol. II, 
pp. 165-168. 

Consider the case of a finite completely reducible T. Its minimum function 
u*(t) = wi (t) --+ wa(t) where the x; (é) are irreducible in $* and xi (t) ¥ 17 (t) 
(i ¥ j). Any two irreducible factors of x; (é) are similar. Let 2,(é) be such 
a factor. Then z(t) 2¢* w,(t) and (33) gives the structure of & where D” is 
inverse isomorphic to the invariant ring of z,(¢). In particular if m = 1, ie. 
u*(t) is irreducible in *, % is a simple ring ~ a 


ll. Periodic s.l.t. and generalized cyclic algebras. An s.l.t. will be called 
periodic if some power of it, say T” = p ¥ 0, ie. T” is the s.l.t. 2 —> zu = cM 
SS 


‘Cf. Ore [11] p. 241. The connection between the ring of matrices commutative with 


a matrix in a commutative § and the invariant ring was first noted by Frobenius. Cf. 
Wedderburn [18] p. 106. 
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whose automorphism is a > » ap and matrix (in any coordinate system) is yl. 
If a — & and T are the automorphism and matrix of T relative to (e,, --- en), 
T” is the s.l.t. whose automorphism and matrix are 


a>za™ and TT--- 7”. 
Hence N.a.8.C.’S that an s.l.t. be periodic are 
(34) a” = pop TT .-.-T"™ = wl. 
Thus a — &@ has relative finite order, say g and m = gr. (34) implies. 


gi = T Aw pm ad rr ee err aa T* AT” = yl 


and so p*(t) = t” — we B*. T is a finite s.l.t. whose minimum function is a 
factor of u*(t). 

THEOREM 20. If x(§) = p is not a divisor of m/q = 1, T is completely 
reducible. . 

If a? = vay", u = tv commutes with every a. ¢” — » = wp — u where 


pi =p ...9%y, Since uv ¥ 0, the factors of u’p — vin B* are polynomials 
in u. Suppose 
(35) up — w = m(u)'m (u) mi (u) € B*. 


Since u commutes with the coefficients of these polynomials, we may differen- 
tiate (35) formally with respect to u and obtain 


ru'p = [mi (u)mi(u)! + at (u)'at (u)) m2 (u) + at (u)’ a2 (u)’ = 0 (m1 (w)) 


since 7: (u) «B*. If r ¥ 0 (mod 7p), this implies (u”, u’p — uw) # 1 whieh is 
impossible since » ~ 0. Hence ¢” — uw has no multiple factors in $* and T is 
completely reducible. 

If m = q, T will be called normal. In the remainder of this section we 
suppose that T is a normal periodic s.l.t. In this case u*(t) = t* — yp is irre- 
ducible in 8*. Hence yu*(¢t) is the minimum function of T which is then com- 
pletely reducible. If 


u(t) = t — ty fo ae 


is an irreducible factor of u*(t) in §[¢], then 1 | m, m = m’l and a matrix of T 
has the form 


(36) 
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where ; ; 
0: - py 
1 . 
(37) 
Me 
L 1 H1) 


Thus 1| n,n = n’l. We shall call 1 the index of T. 
T is a multiple of the identity matrix if and only if 1 = lie. u(t) =t—- w= 
t-—v. Then 


t? —L= (t on v(t + {t 256? + see + pp” ae ped) 


and hence 
w= N(v) = w---  aiied 


and conversely if 1 = N(v), T has index = 1. 

TurorEM 21. A n.a.s.c. that a normal periodic s.l.t. have index 1 is that 
u = N(v) for some v in &. 

In particular if 1 = 1, we may take v = 1 and then (36) is the identity matrix. 
In the language of matrices this becomes the following extension of Hilbert’s 
theorem on cyclic fields.” 

CoroLLary. Let § be any field and a — & any automorphism whose order and 
relative order are finite and = gq. If T is a matrix with elements in § such that 
TT... T° = 1, then there exists a matrix A in § such that T = AA. 

The automorphism ring & of T with matrix (36) is according to the general 
theory a simple algebra D,’ where D is inverse isomorphic to the invariant 
ring of u(t). If T has the index 1 (u(t) = ¢ — v). It follows from the definition 
of the invariant ring that D is inverse isomorphic to the subfield of § of ele- 
ments 8 such that B = vy 'By. 

If n = q, T is cyclic and Y% is inverse isomorphic to the invariant ring of 
u*(t) = t' — uw. The latter is a generalization of cyclic algebras’ first con- 
sidered by Dickson. The above results are applicable to the study of its 
structure. Thus we have the following extension of a theorem conjectured 
by Dickson and proved by Albert.” 

TuEorEM 22. If q is a prime, the invariant ring U of (t* — u) « B* is a field 
fu * N(v) for any ve§. If uw = N(v), AU ~ D, where D is the subfield of p’s 
such that B = vB. 


12. Differential transformations. In this section we suppose that § is 
commutative and T is a d.t. in ® over §. We denote the subfield of constants 
(elements 8 such that B’ = 0) of § by &. 





'’ Hilbert [5] p. 272. 

“A discussion of eyclie algebras from the point of view of the invariant ring of non- 
commutative polynomials is given in Jacobson [6]. 

* Dickson [3] p. 227 and Albert [1] p. 624. Our extension consists in removing the 
assumption that § has a finite basis over its centrum. 
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We have seen that the automorphism ring Y& of T is given by the matrices 4 
such that 


(38) [A, T] = AT — TA = A’ 


where T is a matrix of T. This matrix equation is equivalent to a system of r? 
linear homogeneous differential equations in the elements of A. Hence the 
number of solutions of (88) which are linearly independent relative to ® is 
finite. We therefore have the following theorem due to Ore” for cyclic d.t. 

THEOREM 23. If T isa d.t. in R over a commutative field §, its automorphism 
ring has a finite basis over the field of constants &. 

Suppose & is perfect and let &(6) be an algebraic extension of & of degree r. 
Consider §(@). Since @ is a separable element, we may extend the derivative 
defined in § to §(@) by defining 6’ = 0. It follows that (0) has degree r over 
§ also; for if 6” + a6” + --- + am = 0 is the minimum equation of 6 over §, 
we obtain by differentiation aj6”~ +--+ + am = 0. Hence each a; = 0or 
a,eR and m = r. 

Let Rs) be the extension of 9 obtained by letting the coefficients vary in 
o(0). If Tisad.t. in &, it has a unique extension T(@) in 2se): The matrix T 
of T relative to (e:, --- , én) is also the matrix of T(@) relative to this basis. 
It is easily seen that the automorphism ring of T(@) is Use, where A ts that of T; 
for an automorphism of T(@) is determined by A(@) such that 


(39) [A(0), T] = A(6)’. 
If 0, --- , 6,is a basis for §(@) over §, we may write A(@) uniquely in the form 
Ai, + Ache + --- + A,O, Aen. 


Since 6; = 0, (39) gives [A;, 7] = Aj, ie. A; determine automorphism of T. 
The converse of this is obvious. 

Now suppose T is an irreducible d.t. which decomposes in every algebraic 
extension §(0) of § into parts all of which are similar. Then the automorphism 
ring %{ of T is a normal division algebra over ®. For if the centrum of 2 has 
degree m over &, it follows from the theory of hypercomplex numbers that 
there exists a field 8(@) such that Ys) is a direct sum of m matrix algebras. 
But then T(@) would have dissimilar components unless m = 1. This criterion 
may be used to construct examples of d.t.’s whose automorphism rings are 
normal division algebras. 


BIBLIOGRAPHY 


1. A. A. ALBERT, On normal simple algebras, Trans. Am. Math. Soc., 34 (1932), pp. 620-626. 

2. G. Birxuorr, On the combination of subalgebras, Proc. Camb. Phil. Soc., 29 (1933), 
pp. 441-464. 

3. L. E. Dickson, New division algebras, Trans. Am. Math. Soc., 28 (1926), pp. 207-234. 

4. H. Firrine, Die Theorie der Automorphismenringe Abelscher Gruppen etc., Math. 
Annalen, 107 (1932), pp. 514-542. 





*1 Ore [11] p. 236. 











es A 


of n° 
the 


hism 


ee f. 
ative 
over 
er ff, 


0 or 


‘y in 
ix T 
asis, 
of T; 


form 


Fn: 
of T. 


ric 
hism 
has 
that 
ras. 
rion 

are 





PSEUDO-LINEAR TRANSFORMATIONS 507 


5. D. Husert, Die Theorie der algebraische Zahlkorper, Jahr. Deutsch. Math. Verein., 
* (1897). 

6. N. ps Non-commutative polynomials and cyclic algebras, Annals of Math. 35 
(1934), pp. 197-209. 

7. N. Jacopson, On pseudo-linear transformations, Proc. Nat. Acad. Sci., 21 (1935), pp. 
667-670. 

8. W. Krutu, Uber verallgemeinerte endliche Abelsche Gruppen, Math. Zeitsch., 23 (1925), 

. 160-196. 

9. i ca Theorie und Anwendung der verallgemeinerten Abelschen Gruppen, Sitz. 
Heidelberg Akad., 17 (1926), pp. 3-32. 

10. E. NorrHer AND W. ScHMEIDLER, Moduln in nichtkommutative Bereichen etc., Math. 
Zeitsch., 8 (1920), pp. 1-35. 

11. O. Ore, Formale theorie lineare differentialgleichungen II, Jour. fiir Math., 168 (1932), 
pp. 233-252. 

12. 0. Org, Theory of non-commutative polynomials, Annals of Math., 34 (1933), pp. 480-508. 

13. O. OnE, On the foundations of abstract algebra I, Annals of Math., 36 (1935), pp. 406-437. 

14. C. Secre, Un nuovo campo di ricerche geometriche, Atti Torino 26 (1889), pp. 276-301. 

15. B. L. van DER WAERDEN, Moderne Algebra I and II, J. Springer, Berlin, 1930. 

16. B. L, van DER WAERDEN, Gruppen von lineare Transformationen, Ergebnisse der Math., 
J. Springer, Berlin, 1935. 

17. J. H. M. WeppERBURN, Non-commutative domains of integrity, Jour. fiir Math., 167 
(1932), pp. 129-141. 

18. J. H. M. WeppERBURN, Lectures on matrices, Am. Math. Soc. Colloquium Publica- 
tions, vol. XVII, New York, 1934. 


Bryn Mawr CouieGe. 














ANNALS OF MATHEMATICS 
Vol. 38, No. 2, April, 1937 


A CLASS OF NORMAL SIMPLE LIE ALGEBRAS OF 
CHARACTERISTIC ZERO’ 


By N. Jacospson? 


(Received March 24, 1937) 


Introduction. The classification of simple Lie algebras (infinitesimal groups) 
over the field of complex numbers is due to Killing and to Cartan.’ In a 
second paper’ Cartan determined the simple Lie algebras over the real field, 
The methods used in these papers are purely algebraic and so it is clear that 
the results hold equally well for arbitrary algebraically closed fields of char- 
acteristic zero and arbitrary real closed fields.. More recently W. Landherr’ 
considered this problem for arbitrary fields of characteristic zero and showed 
that it could be reduced to that of determining all normal simple algebras i.e. 
algebras which remain simple when the underlying field is extended to its 
algebraic closure. It therefore suffices to classify those Lie algebras which 
become a fixed Lie algebra in the Killing-Cartan list when the field is extended. 
Landherr has obtained a partial solution of this problem for the algebras which 
lead to Cartan’s type A.° In this paper we determine the algebras which lead 
to Cartan’s types B, C, D.’ We give an invariantive realization of these sys- 
tems and also obtain their groups of automorphisms. The relation between 
the Lie algebra and this group seems to be an algebraic substitute for the rela- 
tion between an infinitesimal group and the integrated group given by Lie’s 
theory. 


1. Anti-automorphisms in an algebra. Let % be an associative algebra with 
a finite basis over a commutative field . It is readily verified that if we 
define in %{ the new composition [a, b] = ab — ba, 2% becomes a Lie algebra ie. 


ala, b] = [aa, b] = [a, ab] [a + b, c] = [a, c] + [b, c] 
[a,b] = —[b, a] —_[[a, b], c] + [[b, c], a] + Ile, a], 6] = 0. 


Suppose % is self-reciprocal i.e. there is defined a correspondence (an anti- 
automorphism) J, a — a’ in A such that 


(a+b =a +b’ (aa) =aa’ (ab) =a’. 





1 Presented to the Society, Dec. 29, 1936. 

2 National Research Fellow. 

3 E. Cartan, Thése, Paris (1894) Chapter 5. 

‘ Les groupes réels simples et continus, Annales de l’Ecole Normale, 31 (1914) pp. 263-356. 
5 Uber einfache Liesche Ringe, Hamb. Abhandlungen 11 (1935) pp. 41-64. 

6 His classification is complete for p-adic fields. 

7 With the exception of type D and order 28. 
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We shall call a J-skew if a’ = —a. Let ©, denote the set of J-skew elements. 
If a, be S, so do a + b, aa and [a, b] so that GS, is a Lie subalgebra of %. 

A second anti-automorphism K is cogredient to J if K = S ‘JS where S 
is an automorphism of %&. In this case the elements of Gx are a*, a in G, 
(Sx = S;) and the correspondence a — a* establishes an isomorphism between 
the Lie algebras G, and Gx. 

The elements g of %{ such that gg = yl = g’g where y ¥ 0 € @ will be called 
J-orthogonal. Their totality is a group @, under multiplication. G, contains 
the invariant subgroup D consisting of the #-multiples of 1. We note that 
= (9 'Y =rG) =v 9 = 9 and (g'ag)’ = g’a’g’) ' =g"'a’g. Hence 
if ais J-skew so is g ag and so if g is a fixed element of @, the correspondence 
a— gag is an automorphism of the Lie algebra S,. If S is an automorphism 
of % then yl = g‘g”* = rrr = g°g**, K = S“JS. Thus the corre- 
spondence g — g* is an isomorphism between @, and Gx. 

Let & be an over-field of @ and £,, &,--- be a basis of @ over &. The 
elements of the algebra % = %z obtained from % by extending & to & are 
uniquely expressible in the form a@ = £a; + fa, + --- (finite sums) where 
a;e%. If M is an automorphism or anti-automorphism of %, M has a unique 
extension defined by a” = £,at" + &az’ +--- in YW. If M = J is an anti- 
automorphism and a is J-skew, 


Eat + fag +--+: = —(fiai + ba + ---) 


then a} = —a;. Thus the J-skew elements of 9% are the linear combinations 
with coefficients in @ of the J-skew elements of Y i.e. Gj, = Gyz. 

J is involutorial if J” = I the identity mapping. If K is cogredient to J 
then K is involutorial also. 


2. Skew elements of a matric algebra. In the remainder of the paper J, 
K will denote involutorial anti-automorphism (i. a. a.), % a normal simple 
algebra over ® of characteristic 0. 

Suppose first that 2%  &, the matrix algebra of order n° over &. The 
correspondence Jo: A = (ai;) > A’ = (ay) is ani.a.a. If J is any anti- 
automorphism J = SJ» where S is an automorphism and since any auto- 
morphism of a normal simple algebra is inner’ we have A’ = S 'A’S where 
A— §"AS is the automorphism S. The condition J® = J is SS’A (S"S’) = 
A for all A. Hence S'S’ = ol and S’ = oS and since S” = S, o = 1 and 
¢= +1. Thus S is either symmetric or skew-symmetric (relative to Jo). 
Thei.a.a. K = V"JV is given as AX = T'A’T where T = V’SV and AY = 





on Jo is defined by means of a fixed basis E;;, of & such that Ei; Eu = 6;.Ea: we have 
eo. K,. 
¥) at. 

*For a proof of this theorem see M. Deuring, Algebren, Springer 1935, p. 42. 


“Cf. A. A. Albert, Involutorial simple algebras and real Riemann matrices, Annals of 
Math. 36 (1935) p. 897. 
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V'AV as is readily verified. It should be noted that the matrix S associated 
with J is determined only to within a multiple in ®. Hence we have 
THeorEM 1. Thei.a.a. J and K are cogredient tf and only if the matrix § of J 
is cogredient (in the usual sense) to a multiple of the matrix T of K. 
(1) If S’ = —S, since S is non-singular n = 2y, v an integer and S is co- 


gredient to 
0 1, 
—l, 0 


(1, the identity matrix of #,)." The corresponding i. a. a. K defines the Lie 
algebra ©x consisting of the matrices of the form 


* uu me 
An Ax 








where Aije ®, and Au’ = —Axn, An’ = Ap, Aor’ = Ag = Thus Gr has 
order n(n + 1)/2 and is generated by the abelian subalgebra § of matrices 
[1 
» 
(1) H = ae 
»,} 
and 
RB Ex 0 E Ex 0 
us 0 = Ath, = 0 —~Ex 


0 Ex ++ Exi 0 0 
Ey ;44 = | nn sa 
0 0 Ex + Ex: 90 


0 Eu 0 O 
Ex; ~ E_», = 
0 O E; 90 


where 7 < k and £;; is the matrix in 4, having a 1 in the 7** row and j™ column. 
The linear forms a = +); + x, +2X; are the roots of Sx. The multiplication 
table of this algebra is 





1 See, for example J. H. M. Wedderburn, Lectures on matrices, New York, 1934, p. %. 
12 Sx is the ‘infinitesimal group” of the complex group when @ is the field of complex 
numbers. Cf. H. Weyl, Theorie der Darstellung kontinuierlicher halbeinfacher Gruppen I, 
Math. Zeitschrift 24 (1926) p. 333. 
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(H, , Hs] = 0 for any H,, H2in 9 
[H, E,] = ak, [Ea ‘ E_.] _ H. P © 


EJ = 0 if a + B is not a root 
[Ea , Es] = Nas Ea+s ~ 0 Nag an integer if a + £ is a root. 


It follows that Sx is a simple Lie algebra. For if § ¥ 0 is an ideal in Sx and 
B= H. + > tae ¥ 0 is in § then so is 











(3) [H, [H, --- (H, B]--- ]] = Li kaa’ Ea r=1,2,.--. 
for all values of \,, -:* , A». The determinant of m = 2y* rows (m, the number 
of roots) 
ay Qa *** Am 
2 2 2 
ay) Qe +++ Am 
Vi, «++, %) = = IT a; IT (a: — m) 
7 t<k 
Or Ge *** Om 
where the a; are the roots is not identically 0 and so Aj, --- , \° can be chosen 
in @ so that V(AZ, ---, 2) ¥ 0. By multiplying the relations (3) by the 


cofactors of the column containing @ in V and adding we obtain that 
VAL, +++, Me) ka Ba € 3 


and hence also kaHa. Thus if ka ¥ 0, Hae. If v > 1 every root may be 
obtained from a fixed one by adding roots and hence by the third relation (2) 
all E, belong to &. It follows that all H, ¢% and since the H,. generate 
§,3 = Sx. If» = 1, 3 D [E,, E_«] which forms a basis for § and hence 
3D E_. also. If all x2 = 0, Ho # 0 and some [Hy, E,] is a non-zero multiple 
of E,. As before it follows that ¥ = Gr. 

(2) 8’ = S. In this case S is cogredient to 


(p1 ) 
Pe 











t Pn) 
and if ® contains Vi, Sis cogredient to 1, and also to either 
; 0 1 
(5) 0 i, or j 
1, O 
1, O 


v 
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according as n = 2v + lorn = 2v."° If we use the i. a.a. corresponding to 1, 
we sce that G, is isomorphic to the Lie algebra of all skew-symmetric matrices 
in ®, and so G, has order n(n — 1)/2. For our purposes it is more convenient 
to take the i. a. a. defined by the matrices (5) and to distinguish the two cases 
odd and even order. A discussion analogous to that given in (1) may be 
made. We shall merely state the results referring the reader to Prof. Weyl’s 
paper" for the details. 
(a) n = 2y + 1. § is the subalgebra of matrices 


(0 
A 


H= ry 
= 








—), 
and there are E#,’s such that (2) holds. The roots a are +A; +); (¢ < k) and 


+r;. As before Sx is simple. 
(b) n = 2v. § is the subalgebra 


Pi 


H = 
om © 








t —r,) 


and the roots a corresponding to the Ey, are +d; +A, (i < k). The proof 
given above for the simplicity of Gx in (1) is valid here forvy > 2. Ifv =1©x 
is abelian and if » = 2 Gx is a direct sum of two algebras of type 2a in which 
y = I, 

It can be shown that Gx of type 2a with vy = 1 is isomorphic to Sx of type 1 
for vy = 1 and type 2a with vy = 2 is isomorphic to type 1 with vy = 2. No 
other algebra of type 2 is isomorphic with one of type 1.” 

In either (1) or (2) the set of matrices Gx are an absolutely irreducible 
system.” By Burnside’s theorem the enveloping algebra of Gx (the smallest 
subalgebra of ©, containing Gx) is ®, . 








18 Wedderburn, Lectures on matrices, p. 95. 
14 Loc. cit. in 12, pp. 342-345. 

15 Cartan, Thése, p. 78. 

16 Weyl II, p. 334 and p. 344. 
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3. Skew elements of a normal simple algebra. Now suppose that Y% is a 
normal simple algebra of degree n (order n’) and Q is the algebraic closure of ®. 
Then % 2 2, and % has a representation a — A by matrices in 2 such that 
the linear combinations > & A; with coefficients &; in Q of the matrices (rep- 
resenting the elements) of % comprise all the elements of @,. The i. a. a. 
J in Y induces an i.a.a. J in @, A — A’ where A’ corresponds to a’, >> &:Ai—> 
> a:Ai. The J-skew elements of ©, are the linear combinations with coeffi- 
cients in @ of the matrices of S;. Now let G be a matrix of % such that G 
is J-orthogonal in Q, i.e. GG’ = 71, 7 = 0inQ. Then GG’ is a matrix of A 
and commutes with all the matrices of %. Hence 7 = vy « © and G is the matrix 
of a J-orthogonal element of 2. 

We have seen that there is an automorphism S in Q, such that K = S“JS 
is one of the normalized i. a. a.’s defined in the last section and Gx the set of 
K-skew elements of 2, is one of the systems (1), (2a) or (2b). The corre- 
spondence a — A® is a second representation of %& by matrices in 2, and the 
j.a.a. J in Y& induces by means of this representation the i. a. a. K in Q, since 
A’ A’ = A*“, We may restrict our attention to the second representation 
and write a — A in place of a > A* and J in place of K. We shall say that 
©, has type (1), (2a) or (2b) according as Gy, is the system (1), (2a) or (2b). 
If S; has type (1) its order is n(n + 1)/2 and if it has types (2) its order is 
n(n — 1)/2. In either case since the enveloping algebras (1), (2a) or (2b) 
is 2, , the enveloping algebra of SG, is %. 

A Lie algebra 2 over ® will be called normal simple if &o where © is the alge- 
braic closure of @ is simple. This of course implies that 2 is simple. For if $ 
is an ideal in %, Xp is one in 2g. We have seen that all the algebras SG, with 
the exception of type 2 for n = 4 are normal simple. We summarize our 
results in the theorem: 

TororEM 2. Jf %& is a normal simple algebra of degree n and J an i. a. a. 
in U, Sy the set of J-skew elements is a normal simple algebra except when it has 
type2andn = 4. The order of GS; is n(n + 1)/2 (type 1) or n(n — 1)/2 (type 2). 
In either case the enveloping algebra of Sy is A. 

Corottary. If J and K are i. a.a.in a normal simple algebra and S; = Sx 
then J = K. 

For the elements a of G = G, = Gg we have a’ = —a = a*. Since the 
enveloping algebra of S is 2% any element b in % has the form ps Qi -** Gy, 
a;¢S. Hence b? = >> (—1)’a,a,. ++: a, = bX, ie. J = K. 

4. Automorphisms of the Lie algebra S,. From now on we suppose when 
€; has type 2 that n => 6 and ¥ 8. In view of the isomorphisms noted in §2 
the only loss in generality here is in the restriction n ¥ 8. 

_let G:a — a® be an automorphism 40 of S,. G is (1 — 1) since G, is 
simple and the elements mapped into 0 by an automorphism form an ideal. 
We shall show that G may be realized as a similarity transformation by a 
J-orthogonal element. For this purpose we suppose first that @ = Q is alge- 
braically closed. Then % = Q, and G,; may be taken to be one of the systems 
(1), (2a) or (2b). 
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If A is a matrix of G, the correspondence A — A® is a second representation 
of G, and since the totality of matrices {A°%} coincides with the totality {A} 
this representation is irreducible also. If, as above, H = H(Ax, --- , d,) denotes 
the general element of the maximal abelian subalgebra §, by Cartan’s theory 
of representations there exists a matrix Q in 2, such that 


Ay 


ns Q7H°Q = Ae 


where the A’s the weights of the representation are linear forms in the \’s with 
coefficients all integers in (1) and either all integers or all halves of odd integers 
in (2).” Since every A of GS, is similar to — A’, if Ao is a root of the character- 
istic equation of A so is —Ao. It follows that the negative of every weight is a 
weight. For if no A, = —A,, \; may be chosen = }§ in Q so that 


A,(At, nig , 9) * 0, # —Ax(M, ih , dy). 


Suppose Q"E2Q = F. = (¢ija) where E, is defined as in §2. Since [H, E,] = 
aE,, |(K, Fol = aFq and so agija = (Ai — AjGija. Since F, ¥ 0 it follows 
that every root a is expressible as a difference of weights. Thus the number of 
linearly independent weights is v. We order them lexicographically” and may 
suppose that A; > A, > --- >A, >O> —A, > -+- > —Ay. We recall also 
that the set of weights is invariant under arbitrary permutations of the ’s.” 

(1). In this case the weights are +A, --- , +A, and the possible differences 
~0 are tA, + Ay, p ¥ gq and +2A,. Since their number is equal to the 
number of roots all of these differences are roots. Evidently 2A; > +A, + A, > 
+2A, if r ~ 1 and so 2A; = 2), the highest root and Ay; = );. 

(2a). The weights here are 0, +A, --- , +A, and the possible differences 
are +A, + Az, +A,, +2A,. If 2A, is a root A, = 4/2 or = (Ai + \x)/2 
(¢ < k) and since the coefficients must all be halves of odd integers, v = 1 or 2 
which are cases outside of the present consideration. Thus no +24, is a root 
and so all +A, + A,, +A, are. Comparing highest linear forms we have 
Ay + Ac = i +e. If Ay — Ae = AG, Ay = (Ar + Ae + A,)/2 and in order 
that the coefficients all be halves of odd integers i ~ 1, 2 and so vy = 3,7 = 3. 
Thus As = (Ai + Ae — A3)/2 and by permuting the )’s and changing all signs 
we see that (Ai — Ae + As)/2 and (A; — Ae — A3)/2 are also weights. But this 
is impossible since there are only v = 3 positive weights. It follows that 
A; — Ae = \y + \% andi < k since Ay — Ag is positive. If k ¥ 2,7 41 for 
otherwise the coefficients of Ay = (Ai + A» + A; + As)/2 would not satisfy 
our arithmetic condition. Hence v = 4, Ay = (Ar + Ae + As + d4)/2 and 





17 Cf. Weyl, Darstellung kontinuierlicher halb-einfacher Gruppen I, Math. Zeitschr. 28 
(1925) p. 278 and Wey! II p. 334 and p. 344. a 
18 We say that A; > Aj or A; — Aj = mA + --- + md, > Oif the first mz # 0 is positive. 
19 Weyl II, p. 334 and p. 344. 
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As = (M1 + re. - A3 + hy) /2. Then A3; = (Ay — \x + As + hy) /2 and Ay = 
(hy — Ae — As \s)/2 are the remaining positive weights. But then +A, + A,, 
+A, do not exhaust all the roots and so this case is ruled out also. Hence 
k= 2%= 1 and Ay — Ae = Ay + de. Since A; + 0 Ay — Ae = Ay — Ae 
and A; = Ai. 

(2b). The weights are +A), ---, +A, and the possible differences are 
+A, + A,, +2A,. As in (2a) we may rule out the possibility that +2A, are 
roots and so all +A, + A, are. Again Ai + As = Ai + Ae. As above 
A, — Ae = A; is impossible and Ay — A, = Ay +t et ¥ 1, k ¥ 2 givesy = 4 
a case excluded by our assumption.” Hence A; — As = \; — de and again 
Ay = M1 : 

Since the highest weight A: = \; the representation by A® is similar to that 
by A.” Thus there is a fixed matrix G in Q, such that A® = G"AG for all A. 
Since A® and A are J-skew we have also A° = G’A(G’)™ and hence GG’ com- 
mutes with all A. Since the system {A} is irreducible GG’ = yl, y ¥ 0. 

We may now prove the following theorem. 

TuEeorEM 3. Jf & is anormal simple algebra, J an 1. a. a. in A and G an auto- 
morphism of the Lie algebra G, then there is a J-orthogonal element g such that 
a° = gag for all a.” 

We have seen in §3 that 2% may be represented by matrices in 2, where Q 
is the algebraic closure of ® the field of %, in such a fashion that J induces 
in 2, a normalized i. a. a. of §2 and the linear combinations with coefficients 
in 2 of the matrices of Gy, is the system (*) = (1), (2a) or (2b). The auto- 
morphism G extends to an automorphism A — A®% of (*). We have seen that 
A° = GAG where G@’ = 71, 7 ¥ 0in@. In particular for A of S, we have 
A° = @"AG. Since the enveloping algebra of the matrices of G, is the algebra 
of matrices of & we have G BG is a matrix of % if B is and so B > G BG is an 
automorphism of %{. But the automorphisms of % are inner and hence there 
is a matrix G of & such that @ “BG = G"BG. It follows that G@ = pG@ and 
since G is J-orthogonal, so is G. 

TuEorEM 4. The group of automorphisms $, of S; = G,s/D where D is the 
set of multiples 51, 6 # O in ®. 

Theorem 3 shows that @, is homomorphic to $,. It is easily seen that 
the elements of @,; which define the identity automorphism of G, are those 


of D. Hence S; & G,/D. 





** Our proof breaks down for vy = 4 since Ay = (A: + Ae + As + a)/2, Ae = (i +e — 
= M)/2, As = (Ar — Ae + As — Ay) /2 and Ay = (Ai — Ao — As + Ay) /2 are weights satisfying 
our conditions and +A, + A, give all the roots +A;+,;. The result does not hold in this 


wp * has been shown by E. Cartan: Le principe de dualité, Bull. Sci. Math. 49 (1925) 
Pp. 307, 


*! Weyl I, p. 281. 
* This theorem gives an algebraic connection between Gy and @,. If #is the field of 
complex numbers an analytic connection between these systems is known by Lie’s theory. 
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5. Isomorphism of distinct S,’s. Let 2%; and %2 be two normal simple alge- 
bras and Ji, Jzi.a.a. in %, and % respectively such that S;, > ©,,. Then 
Ss,0 = Sr and since the algebras of type 2 that we are considering are not 
isomorphic to any of type 1, ©,, and G,, have the same type and hence Y, 
and %2 have the same degree n. We have seen that % and %2 have repre- 
sentations in Q, such that the linear combinations in Q of the matrices of S,, 
or S,, comprise the elements of one of the systems (1), (2a) or (2b) denoted 
as (*). The isomorphism A; — A? defines an automorphism in (*) and hence 
by Theorem 3 A, = G@ AiG. Since the enveloping algebras of the matrices of 
©,, and S,, using linear combinations in # only are the matrices of %; and %, 
the correspondence B, — G "BiG = B for any matrix B, of %, defines an iso- 
morphism between %; and %. If we identify %; and % and repeat the argument 
we see that there is an automorphism of the matrices of YU, B — G BG sending 
the Ji-skew matrices into the J2-skew matrices. It follows from the corollary 
to Theorem 2 that J; and J2 are cogredient. 

THEOREM 5. A necessary and sufficient condition that S;, + Sy,, where J, 
and J» are i.a.a. in the normal simple algebras %, and As respectively is that 
WM, > %e. and J; and Je be cogredient. 


6. Connection with the theory of abstract Lie algebras. We have seen that 
any normal simple algebra with an i. a. a. J leads to a simple Lie algebra which 
becomes one of the systems (*) when the field is sufficiently extended. In this 
section we shall prove the converse, that any Lie algebra % that has the type (*) 
when the field is extended is isomorphic to an algebra ©, . 

Let % be a Lie algebra over ® and © the algebraic closure of &. We suppose 
that % has one of the types (*). Thus if a,, a2, +++, @m is a basis for % and 
hence also for %& there exist elements h, e. linear combinations of the a; in 2 
satisfying the relations (2). But these linear combinations involve only a finite 
number of elements of 2 and so there is an algebraic extension of finite degree 
over ® such that & = &; has the type (*). We may suppose also that % is a 
Galois field with the Galois group G@ = (i, s,t--- ). Thus & has a representa- 
tion by matrices in ®, such that the linear combination of these matrices with 
coefficients in ® constitute the system (*). The matrices A; corresponding 
to a; form a basis for (*) over ® and if [a;, a;] = Dovjear (vijn €®) then 
[A:, Aj] = Do viznAx. It is clear from the definition of the system (*) that 
if A = (4;) belongs to the system then so does A‘ = (ai,) for any se@. In 
particular A; belongs to the system, Aj, ---, A%, are linearly independent 
relative to @ and [A, A$] = )> yi;Ai since Vik = Yin €®. It follows that 
for each s ¢@ the correspondence A = >> a; A;— A® = 2) a Ai is an auto- 
morphism of the Lie algebra (*). By Theorem 3 there exists a matrix G, in ®, 
such that A* = G;'4G,. For the matrices A = ). a; A; (a; € ®) representing 
the elements of £ we have A* = A* and hence 


(6) A’ = GAG, s eG. 
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The totality of matrices satisfying the equations (6) is an algebra % over & 
containing the matrices of 2. Since the enveloping algebra of the matrices 
of & is & there are n” linearly independent matrices Ai, --- , Aj: of the form 
Ai, Ai *** Aty (is = 1,--:,m) and these belong to &%. Now suppose B = 
Dr BA; is in A. Then 


> piAi = B = G BG, = > 8:G04.G, = > BA} 


and hence 8} = B; = Bi ¢®. Thus & consists of the totality of linear sums of 
A,,-+*, Ant with coefficients in ®, or 2% is the enveloping algebra of the mat- 
rices of £ using coefficients in only. Since % = Az = &,, A is normal simple. 
Since (Aj, Aig Pre Ai)’ - (—1)*Ai, ay Ai, Ai, e Wf (is = 1,2,-::, m), Yt is 
carried into itself by the i. a. a. J defined in Y and hence J is an i. a. a. in A. 
The matrices of 2 are precisely the J-skew elements of 2%. We have therefore 
proved: 

TuroreM 6. Jf L is a Lie algebra such that %g is isomorphic to one of the 
algebras (1), (2a) or (2b) then 2 may be realized as the set of J-skew elements of a 
normal simple algebra Y%. 


7. Cogredience in a normal simple algebra. It has been shown by Albert™ 
that if 2{ is an involutorial normal simple algebra, i.e. has an i. a. a. defined 
in it then 2 = §, where § is an involutorial normal division algebra. Thus 
§ has exponent 2 and its order is a power of 2 and if ® is suitably restricted 
(e.g. an algebraic field of the rationals, a p-adic field, or the real field) then § 
is a quaternion algebra.“ We consider, however, the general case of & arbi- 
trary and shall investigate the condition that two i. a. a.’s in §, be cogredient. 
The discussion is similar to that given in the proof of Theorem 1. 

We begin with a fixed i.a.a. A = (a;;) — (Gj) = A’ where aj; § and 
a— dis ani.a.a. in §. As before an arbitrary i. a.a. J in §, has the form 
A’ = §'A’S where 8’ = +8, ie. S is a hermitian or quasi-hermitian matrix. 
If AX = T*A’T we have as before that K is cogredient to J if and only if 
TT = V'SV, re ®. Thus the question of cogredience of i. a. a. is one of ordi- 
nary cogredience of hermitian or quasi-hermitian matrices with elements in a 
normal division algebra. We hope to discuss this problem in a later paper. 


University or CHIcaGo. 





* Loc. cit. in 10, pp. 894-901. 
“These results are due to Brauer, Albert, Hasse and E. Noether. For proofs and 
references see Deuring’s Algebren. 
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EINE LIMESBEZIEHUNG BEI GEWISSER VARIATION DER 
GRADZAHL EINES POLYNOMS 


Von Erix L. PEtrrerson 


(Received September 8, 1936) 


Folgender Satz wird bewiesen: 
Satz. Gegeben sei ein Polynom f(x) der Form 
f(x) = g(x)" M(z) + N(z) 


mit festen normierten ganzzahligen Polynomen g(x) und M(x) und festem ganz- 
zahligem Polynom N(x). Ferner soll N(x) keinen Faktor mit g(x)- M(x) gemein- 
sam haben. Von den Gradzahlen der irreduziblen Faktoren von f(x), die nicht 
Teiler von einem Polynom der Form 


g(z)'-1, (¢ 21), 
sind, seir, die kleinste. Dann gilt 


limr, = ©. 


Bewels. Es sei &, eine Wurzel von f(z). Wird é, immer so gewahlt, dass 


|g(&:)|>1+e 


ist, mit festem « > 0, wenn n eine unendliche Reihe von wachsenden positiven 
ganzen Zahlen durchliuft, so muss fiir diese n gelten 


(1) Mt:,) > 0 fiirn— o. 
Aus 


| g(En) | 5 '¢ 
folgt in ahnlicher Weise 


(2) N(é,) — 0 fiir n > o. 
Nach den Beziehungen (1) und (2) und der iibrigen Méglichkeit 
| 9(&) | > 1 fir n > & 


folgt, dass die Wurzeln von f(z) fiir alle n beschrinkt sind. 

Ist die in dem Satz definierte Gradzahl r, beschrankt, so muss wenigstens ein 
irreduzibler Faktor A(x) vom konstanten Grade r fiir unendlich viele n vorkom- 
men. Fiir hinreichend grosse n ist f(x) und folglich auch A(z) normiert, und 
man kann demnach setzen: 
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A(z) = 2" + bi(n)x"™™ + be(n)x™™ + --- + b(n). 


Da die Wurzeln von A(z) beschriinkt sind, so sind die Koeffizienten b,(n), 
= 1, 2,-++,7), auch beschrinkt. Diese b,(n) sind ferner ganze Zahlne, 
und es folgt dann, dass die Anzahl der Faktoren A(x) des konstanten Grades r 
endlich ist. Es gibt daher wenigstens einen festen Faktor 


A(z) = 2° + ba +--- +5, 
der fir unendlich viele und also sicher fiir zwei verschiedene n vorkommt. 
Man erhilt also 
g(x)""M(x) + N(x) = A(x)Bi(z) 
g(x)""-M(x) + N(x) = A(z)B2(x), m > m, 
mit ganzzahligen Polynomen B,(z) und B2(z). Nach Substraktion folgt 
(3) g(x)"'M(x)(g(xz)"*" — 1) = A(x)(B2(x) — Bi(z)). 


Nach der Voraussetzung des Satzes enthalten N(x) und g(x)M(zx) keinen 
gemeinsamen Faktor, und es gilt dann auch 


A(x) } 9(x)M(z). 
Aus (3) folgt demnach 
A(z) | g(z)"""' — 1), 


im Wilderspruch zur Voraussetzung, dass A(x) nicht Teiler eines Polynoms der 
Form 


g(x)‘ — 1 
sein soll. 


Aus dem eben bewiesenen Satz folgt, dass jeder positiven ganzen Zahl v 
eine positive ganze Zahl U, entspricht derart, dass f(x), wenn 


n2U, 


ist, nur dann irreduzible Faktoren vom Grade < »v enthalten kann, wenn diese 
Faktoren gleichzeitig Faktoren von Polynomen der Form g(x)‘ — 1 sind. 
(Eine einfache Folgerung ist, dass diese Faktoren dann sogar fiir unendlich 
viele n vorkommt, nimlich fir alle n aus einer arithmetischen Reihe n = no + it, 
+= 0,1,2,...). 


Ist das Polynom 
f(x) = g(x)"M(z) + N() 


entweder irreduzibel oder .durch wenigstens einen Faktor teilbar, dessen Grad 
; beschriinkt bleibt, wenn n alle ganzen Zahlen > 0 durchlauft, und kann f(z) 
fir wenigstens hinreichend grosse n keinen Faktor mit Polynomen der Form 
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| g(x)‘ — 1 gemeinsam haben, so muss f(x) irreduzibel sein fiir alle hinreichend 

grossen n. Wird daher beinpieloweine die Theorie der Newtonschen Polygone 
nach Untersuchungen von Dumas und Ore’ auf das Polynom f(z) angewandt, 
so kann Irreduzibilitét fiir alle hinreichend grossen n festgestellt werden, 
Werden 


g(z) = x, M(x) = 1 und N(x) = ph(z) 
gesetzt mit ganzzahligem Polynom h(x), so erhalt man folgendes Beispiel: 


Satz. Im Polynom 


Th —- 


f(x) = x" + ph(z), h(0) 4 0, 
seien p eine Primzahl und wenigstens ein Koeffizient in h(x) nicht durch p teilbar, 
Dann gibt es eine feste positive Zahl U derart, dass f(x) irreduzibel ist fiir allen > U, 

Dass x” + ph(x) keinen Faktor mit einem Polynom der Form zx‘ — 1 gemein- 
sam haben kann, wird folgendermassen gezeigt. Es seien é, fiir vy = 1,2,... ,r 
die Wurzeln eines Faktors von x" + ph(x). Dann folgt 


(II é) =+p' Ul h(é,) 


v=1 v=1 


mm 5 es he 


und also 


p\Ilé 


v=1 





Sind diese ¢, auch Wurzeln von x‘ — 1, so erhalt man als Widerspruch 


(Is) - 





oa ~*~ ee oy & 






und 


STOCKHOLM. 










1 Ore: “Zur Theorie der Irreduzibilitatskriterien,”’ Math. Zeitschr. 18, S. 278-288, 
(1923). 
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ON SOME ANALYTIC SETS DEFINED BY TRANSFINITE INDUCTION 


By C. KuraTowskI AND J. v. NEUMANN 


(Received September 6, 1936) 


In recent papers C. Kuratowski’ has shown that—under very general condi- 
tions—transfinite induction applied in the domain of projective sets (in the 
sense of N. Lusin) does not lead outside of this domain. The purpose of this 
paper is to prove a theorem which gives under some supplementary assump- 
tions a more precise evaluation of the projective class of a considered set. 
Thus the set of Lebesgue” which was shown by the method of Kuratowski to 
be of class 3 will be seen to be a difference of two analytic sets (hence of class 2). 


1. Let 1, 72, ‘++ be an enumeration of the set of all rational numbers, which 
will be held fixed in what follows. Let C be the Cantor perfect set of all 


testot (ec; = 0 or 2). 


For any x eC form the set M, of all r; with c; = 2. Put = ordinal type 
of M, in its natural ordering (and not that one of the enumeration 7; , rz, --- ). 
Thus Z runs over all ordinal enumerable types, if x runs over C. Denote by 
() the set of all numbers x such that M, is a well-ordered set (i.e. @ < Q). 
Observe that « =< M, is a one-to-one correspondence of all elements of C and 
all sets of rational numbers. 


2. The following notations will be used in the definition of the set of Lebesgue. 
For any x eC put 





C1-Qn C3-9n C5-9n 
my = A SR EERE. | (n= 0,1,2,---) 

The symbol «” is defined as follows: if M, contains an r, = r, then Mm) 
is composed of all r; such that r; < r, and r; ¢ M,; if not, put z” =0. Thus, 
ifzeCy and x ¥ 0, xm) runs over all ordinal numbers less than Z. 

Let A(y), ye C, be a “universal’’ function relatively to the family of closed 
and of open subsets of the interval.2 We may assume that the plane set 
E. (2 € A(y)] is a Borel set. 

Now define for any zy with x « Cy and y ¢C the set L(z, y) as follows: 





'C. R. Paris t. 202 (1936), p. 1239 and Fund. Math. 27 (1936). 
* Journal de Mathématiques, 1905, Chap. VIII. 


* This means that X being any set belonging to the family considered, there exists a y 
such that X¥ = A(y). 
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g(x)‘ — 1 gemeinsam haben, so muss f(x) irreduzibel sein fiir alle hinreichend 
grossen n. Wird daher beispielsweise die Theorie der Newtonschen Polygone 
nach Untersuchungen von Dumas und Ore’ auf das Polynom f(z) angew andt, 
so kann Irreduzibilitit fiir alle hinreichend grossen n festgestellt werden, 
Werden 


g(x) = x, M(x) = 1 und N(az) = ph(z) 
gesetzt mit ganzzahligem Polynom h(x), so erhilt man folgendes Beispiel: 
Satz. Im Polynom 
f(x) = x" + ph(z), h(0) # 0, 


seien p eine Primzahl und wenigstens ein Koeffizient in h(x) nicht durch p teilbar, 
Dann gibt es eine feste positive Zahl U derart, dass f(x) irreduzibel ist fiir allen > U, 

Dass «" + ph(x) keinen Faktor mit einem Polynom der Form z‘ — 1 gemein- 
sam haben kann, wird folgendermassen gezeigt. Ls seien é, fiiry = 1, 2,...,r 
die Wurzeln eines Faktors von x” + ph(x). Dann folgt 


und also 


p| Il é, 


v=1 





Sind diese £, auch Wurzeln von zx‘ — 1, so erhalt man als Widerspruch 


(IIs) ~ 


STOCKHOLM. 





1 Ore: ‘Zur Theorie der Irreduzibilitatskriterien,’’ Math. Zeitschr. 18, S. 278-288, 
(1923). 
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In recent papers C. Kuratowski’ has shown that—under very general condi- 
tions—transfinite induction applied in the domain of projective sets (in the 
sense of N. Lusin) does not lead outside of this domain. The purpose of this 
paper is to prove a theorem which gives under some supplementary assump- 
tions a more precise evaluation of the projective class of a considered set. 
Thus the set of Lebesgue’ which was shown by the method of Kuratowski to 
be of class 3 will be seen to be a difference of two analytic sets (hence of class 2). 


1. Let 1, 7, *** be an enumeration of the set of all rational numbers, which 
will be held fixed in what follows. Let C be the Cantor perfect set of all 


=e+et: (c, = 0 or 2). 


For any x eC form the set M, of all r; with c; = 2. Put = ordinal type 
of M, in its natural ordering (and not that one of the enumeration 7 , r2, --- ). 
Thus Z runs over all ordinal enumerable types, if x runs over C. Denote by 
(Cy the set of all numbers x such that M, is a well-ordered set (i.e. ¢ < QQ). 
Observe that « <= M, is a one-to-one correspondence of all elements of C and 
all sets of rational numbers. 


2. The following notations will be used in the definition of the set of Lebesgue. 
For any x eC put 


a Bt ae 2. SS « ... = tad 
l= 3 a 9 + oF + (n 0, 1, 2, ) 


The symbol x” is defined as follows: if M, contains an r, = r, then Mzm 
is composed of all r; such that r; < r, and r; « M,; if not, put z” =0. Thus, 
ifzeCy and xz ¥ 0, x) runs over all ordinal numbers less than Z. 

Let A(y), yeC, be a “universal’’ function relatively to the family of closed 
and of open subsets of the interval.? We may assume that the plane set 
E,, [2 € A(y)] is a Borel set. 

Now define for any zy with x «Cy and y € C the set L(z, y) as follows: 





'C. R. Paris t. 202 (1936), p. 1239 and Fund. Math. 27 (1936). 
“ Journal de Mathématiques, 1905, Chap. VIII. 


* This means that X being any set belonging to the family considered, there exists a y 
such that x — A(y). 
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L(0, y) = Aly) 


L(z, y) = Lim sup L(z™ 


Yn) for z ~ 0°. 
The spatial set T = Eze [z « L(z, y)] is called the Lebesgue set. 


3. The preceding construction of the set 7’ may be generalized as follows. 
Replace the function 2“ by any function f,(x) such that f.(0) = 0 and for 
0 # reCo, fax) < %. Let yn be replaced by an arbitrary function g,(y) of y 
and let A(y) denote an arbitrary set-function. The operation Lim sup will be 
replaced by a general Hausdorff operation. 

Let us recall the definition of a Hausdorff operation. Let B be a set of 
sequences composed of positive integers. Denote by s = (s’, s’, --- ) a variable 
sequence.: Given a sequence of sets X;, X2, --- , form the set 


Lili xke , where s ranges over B. 
8 n 


This set is said to be obtained with the aid of the Hausdorff operation with 
base B. 

In a om the sequence s will be identified with the irrational number 

1 1 

» rs + 3 wei 
numbers. (K.g. in the case of the operation Lim sup the set B is composed 
of all numbers s containing infinitely many different s”). 

Given the functions f,(x), g.(y), A(y) and the base B, it is readily proved 
with the aid of transfinite induction that there exists one and only one function 
L(x, y) defined for all x ¢ Cy and such that 


(L(0, y) = Aly) 
Le, y) = } Il L{f.n(x), gon(y)] where s ¢ B and x # 0. 


Thus B will become a subset of the set of all irrational 


(1) 


TuEoreM. If f,(x) and g,(y) are Baire functions and B and E,, (z « A(y)] 
are analytic sets, the spatial set T = Ezz [z € L(x, y)] is an analytic set relatively 
to the part of the space composed of points xyz such that x € Co; i.e. T is the inter- 
section of an analytic set U and of the product C) X Y X Z. 


4. The following notation will be used in the proof of the theorem. 

Given a correspondence which assigns to each finite system of positive integers 
ky, ke, +++, kn (n = 0) a non-negative integer px, ... %,, denote by P the infinite 
system (p, Pi, P2,***, Pky ---k»***)- Obviously the set § of all P’s may be con- 
sidered as topologically equivalent to the space of all infinite sequences of non- 
negative integers, hence to the space of all irrational numbers. This allows 


3’ Lim sup X, = [Ivan be Xapk ; 
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us to apply theorems concerning the use of the logical quantifiers >>» and [[>' 


with P as variable: so if g(P, x) is a given relation between the variables 
P and z, the set E, > ¢(P, x) is the projection eliminating the variable P 
of the “plane” set Hz» g(P, x); hence if the last set is analytic, so is the first. 

Denote by Sk,..-%,(P) the sequence (px, ...kgt) Pky -+-ky2,°** )» Thus 
S(P) = (—i, D2, *** Ds 

It will be observed that, if ki, --- , k, is given, then py, ...;, and Sx, ...x,(P) 
are continuous functions of P. 

Finally let us write 2” = fa(x), 2°" = fig +++ fa,(2) --* ) and yn = gn(y) 
and Yk, --+ky = Jicy ( oer gx, (y) ‘ates ). If the functions fn(x) and gn(y) are Baire 
functions, the same holds true of the functions ot" *= and ae 

PROOF OF THE THEOREM. Denote by H any finite system of positive integers 
ky +++ kn (n 2 0) and put | P, H | a (Pe ; Pkyke » °** Dkykp---k,)- If n = 0 
we have | P, H | = 0 (the vacuous set), px = p and z = 2, Yip.a| = Y. 

Consider the set U of all points xyz (x e C) satisfying the following condition: 
there exists a P = (p, pi, P2, °** Pa, *** ) Such that: 


1°: p #0 
2°: if py ~ Oand z'””! = 0, then z ¢ A(yjp,x)) 


3°: if pp ~ Oand 2'””! = 0, then Sx(P) ¢ B. 
In logical symbols: 


(tyzeU) = > \(p ~ 0) I] (pax ¥ 0)(2!?*! = 0) > z€ A(yjp.m1) 


2) (px ¥ 0)(x'"""! ¥ 0) + Su(P) € B}. 

The hypotheses of the theorem lead easily to the conclusion that the set 
of points (xyzP) satisfying the condition in brackets {} is an analytic set (in 
the space C X Y X ZX $Y). Therefore the projection of that set, i.e. the 
set U, is an analytic set too. It remains to show that, assuming z e Cy) , U may 
be replaced in the preceding formula by T. 

1) Assume that (yz) ¢ T, ie. that ze L(x, y). We shall show by transfinite 
induction that there exists a P verifying the conditions 1°-3° and such that 
p = any preassigned value. 

Consider first the case = 0. Then by (1) ze A(y). Define P as follows: 
p = any number ¥ 0 and for H ¥ 0 put px = 0. The conditions 1°-3° are 
obviously satisfied. 

Now let 2 > 0 and suppose our assumption holds true for any z such that 
I<a. 

let 7 = a. Since (xyz) e T, it follows by (1) that there exists a sequence 





” 


4 . 
Do; means “there exists an z such that --- ” and []. means “for any z++-’’ Con- 


oa the properties of the quantifiers, see e.g. C. Kuratowski Topologie I, Warsaw 1933, 
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of positive integers s = (s', s’, --+) eB such 1 that, for any n, z ¢ L[f.n(z), gen(y)]. 
That means that (2°", yn, z) « J and since x” < 2, there exists a 


a eee eee 
such that: 
1: p” = 8" 
2°: if py ~ Oand2”!”"”! — 0, then z € A(yenjpn,x)) 
3°: if pp ~ Oand2™!”""! = 0, then Su(P") « B. 


Define P as follows: p = any number ~0 and for n ¥ 0 put p,,...,, = 
Pil ...z,- Then s"|P", H|=p,|P", H| = |P, nH |, Sx(P") = S,x(P) 
and it is easily seen that the conditions 1° — 3° are satisfied. 

2) Now assume that P satisfies the conditions 1° — 3°. We have to show 
that (xyz) ¢ T, ie. that ze L(z, y). 

This is obvious in the case x = 0. For, the condition 2° gives, for H = 0, 
ze A(y), hence, by (1): ze L(0, y). 

Let a > 0 and suppose our assumption is true for x’s such that % < a. 

Consider an x such that = a and a P satisfying the conditions 1° — 3°. 

Put s = (p1, p2,°:: ) ie. s = S(P). Then the conditions 1° and 3’ give, 
for H = 0, S(P)¢«B. Thus it remains to show that z ¢ L[ f(x), gen(y)]. But 
this follows easily from the fact that if P” be defined by putting pj, ...:, = 
Pnk, ---ky » then the conditions 1% — 3), are fulfilled. 

Thus the theorem is proved. It follows at once that the Lebesgue set is the 
difference of two analytic sets (the set Cy being the complement of an ana- 
lytic set).° 


5. Some generalizations. It is readily seen that the proof of the theorem is 
not affected if we assume that Y and Z denote any complete separable spaces, 
that f, and g, are Baire functions of both variables x and y, that B also depends 
upon x and y and that the set H,,, [s ¢ Bz] is analytic. 

Thus we shall have to replace (1) by 


LOO, y) = Ay) 


@) L(x, y) = Zz, ae L{fan(x, y), gen(x, y)] where s ¢ Bz,, and x # 0. 


On the right hand side of (2) we replace x and y by (z, y) and B by 


6 
Bez,y) 1P,H| »(z,y) (PHI 


This generalized theorem may be applied to some other definitions based on 
transfinite induction. 





5 See e.g. Topologie I p. 257. 


* Thus we have (z, y)" = f,(z, y), (2, y)n = gn(z, y), (x, y)™™ = fmol (2, y)", (2, y)al ete. 
For n = 0 we have (z, y)*!*"**» = zand (2, y)ky++-kn = Y 
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1) Consider first the case where Y = Z = C, where A(y) denotes a uni- 
yersal function for the open subsets of C such that the set Ey. [z « A(y)] is open 
and where x, and 2°” are defined as in §2. Put, for x Co, 


(L(0, y) = Aly) 
L(a, y) = [] L(x, yn) for odd 


= >> L(x™, yn) for even ~ 0. 


The fundamental property of the function L(x, y) is that, for any reCo, 
L(z, y) is a universal function for Borel sets of class G; and the set E,, [ze L(a, y)] 
is of that class.’ 

Now here the base B depends on x: for € odd it is composed of one sequence, 
namely of the sequence of all positive integers, and for Z even B is the set of 
all sequences of positive integers. Since the set of all z’s (belonging to C) 
such that is odd is a Borel set,* it follows easily that the set E., [s « B.] is a 
Borel set. Hence the corresponding set U is an analytic set and the set 
T = Euyz lz ¢ L(x, y)] is a difference of two analytic sets. 

2) Using the notations of the preceding example, define x*, for Z odd, so 
that the set M,+ is composed of all elements of M, except the last one. Thus 
F=2*+1. Put’ 


LO, y) = A(y) 
|L(x, y) = II L (c*, y.) for € odd, and for # even, ¥ 0, put 





L(z, y) = OL (Yen+1, Yon) Where n ranges over indices such that Jonyi < &. 


In other terms, define the functions f, and g, as follows: 

(i) if is odd, f,(a, y) = x* and g,(z, y) = Yn; 

(ii) if Z is even, fa(z, y) = Yonyi and gn(x, y) = Yen and let B,,, be defined 
for Z odd as in the preceding example and for # even suppose that it is composed 
of all sequences s such that for any n we have Jom: + 1S The set 

E [se B.,] = Il E [Gen1 + 1S 4] 
zys nm xzys 


being analytic,” the corresponding set U is analytic and T is a difference of two 
analytic sets. 


INSTITUTE FOR ADVANCED Stupy. 
settle cat ihn ail Sit 


"See e.g. Topologie I, p. 173. 

*Z may be called odd (for any z ¢ C) if it is of the form \ + 2n + 1 where is an ordinal 
type without last element. See Fund. Math. 27 op. cit. 

* The essential difference between the examples 1) and 2) is that in the second case the 
function L(2, y), for a given y, depends upon Z alone (and not upon z). See C. Kuratowski, 
C. R. Paris t. 176 (1923) p. 229 and Topologie I, p. 175. 


; rg a theorem of Lusin the set of zy such that  < gis analytic. See e.g. Topologie I, 
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GROUPS BY MATRICES 
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1. Introduction. Let A be any associative hypercomplex algebra over any 
commutative field F. If one defines in A alternants [xy] by setting 


[zy] = yx — xy 


then with respect to the (generally non-associative) “products’’ [zy], the ele- 
ments of A form a new hypercomplex algebra L(A) over F. Moreover it is 
easy to verify the well-known identities of Lie-Jacobi in L(A), namely 


[zy] + lyz] = 0 ([zylz] + [[yz]x] + [lzr]y] = 0. 


Hypercomplex algebras satisfying these identities are usually called “Lie 
algebras,” and it is clear that every module (= linear subset) in A which 
contains the alternant of any two of its members, forms a Lie algebra. 

The first main result proved below is that (Theorem 1) conversely every 
Lie algebra can be obtained in this way. 

But the result is defective in that, although L(A) is of finite order whenever 
A is, it is not proved that every Lie algebra Z of finite order can be obtained 
from an associative algebra of finite order. In other words, it is not proved 
that every such LZ can be represented isomorphically by finite matrices. This 
problem is of course by Lie’s Third Fundamental Theorem equivalent in the 
cases that F is the real resp. complex field to the classical question: is every 
Lie group nucleus G isomorphic locally’ with a group nucleus of matrices? 

The main result bearing on this problem which has been proved hitherto is 
due to Lie, and states that in case G has a discrete central, it is locally iso- 
morphic with the group of linear transformations which its inner automorphisms 
induce on its infinitesimal elements (its so-called “adjoint”? group). It 1s 


known’ that this corresponds to the statement that any Lie algebra L of finite’ 


order which possesses no element a satisfying [xa] = 0 for all x can be obtained 
from an “adjoint” associative algebra of finite order. 
The second new result proved below is the fact that (Theorem 4) in the 





1 It is known (cf. infra) that there exist Lie groups not isomorphic in the large with any 
group of finite matrices. The reader’s attention is called to a paper of I. Ado (Russian) 
in the Bull. physico-math. society of Kazan 6 (1935), which contains the theorem that 
every Lie algebra over the real or complex field is isomorphic with a Lie algebra of matrices. 

* N. Jacobson, ‘‘Rational methods in the theory of Lie algebras,’’ Annals of Math. 36 
(1935), 875-81. 
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extreme opposite case that L is nilpotent, it can be obtained from a nilpotent 
associative algebra of finite order. And using this it is proved that (Theorem 6) 
any “hypercentral” Lie group nucleus is locally isomorphic with a simply con- 
nected Lie group of matrices—whence the universal covering group of any 
“hypercentral” Lie group is topologically isomorphic with a Lie group of 
matrices. 


2. Canonical polynomials and “straightening”. Let L be any Lie algebra, 
and let us suppose that e , é2 , és, --+ constitute a basis of linearly independent 
elements of L, well-ordered in some fixed way.’ Further, let the rule for com- 
mutation in L be expressed by the equations 


[exe] = Di ct ex. 


Since we are in the realm of pure algebra, all sums considered are finite. 

Now suppose A is any associative algebra containing L and generated by 
the elements of ZL. Since A is generated by L, any element of A can be written 
as a polynomial—that is, linear combination of products — = eq) «++ eiq— 
in the e; . 

Suppose now we call a “straightening” of any polynomial, a substitution 
for one of its terms & = é;@) +++ €ig) Of the equal element 


i(j+1) (7) 
Cia + + * Ci¢jp—1y CaCj4.sy CG) CaCi42) * + * Cig) + dick Cicay + + * Ci(j—1) Ck Ci(j42) + * + CCQ 


in case 1(7 + 1) < 2(J). 

It is clear that after at most 4q(q — 1) straightenings, one can replace any 
such product £ by an equal polynomial &* + > , axé, where £* is a product 
of the same degree as £ and has the “canonical” form 


Chir) +++ CRC) [A(1) < --- < h(r)] 
and the & are products of lower degree. Hence, by induction 


Lemma 1: After being straightened, any element of A can be shown to be equal 
to a linear combination of canonical power-products of the e; . 

Observe that one can calculate the canonical polynomial equivalent to a 
given polynomial of A simply by knowing L; the nature of A is irrelevant. 
This suggests, given L, reducing symbolic polynomials in its basis-elements to 
canonical form without introducing A. But as this can be done in various 
ways, it is essential to know that 


Lemma 2: The reduction of polynomials in the e; through straightening to 
canonical form, is independent of the method of straightening. 

Proor: Let & be any product eia) «++ iq, and let ¢’(é) and ¢’(£) be the 
results of two different straightenings of ¢. Then if we can find ways of straight- 


‘niceties teae 
* It follows from the axiom of choice that any module has such a basis: one need merely 
add to a partial basis successively new linearly independent elements as long as any exist. 
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ening ¢’(~) and ¢’’(é) respectively into the same polynomial ¢’’(£), we have 
shown by induction on the degree and the number of straightenings still possible 
of the product of highest degree, that all ways of straightening £ lead to the 
same canonical polynomial 


v(o'(é)) = v@"’"(é)) = v@'"(E)). 


This is what we shall do. 
It is clear that there are essentially only two cases, which may be summarized 
in the two sets of formulas: 


( &= £1 G57 € 51 Eo Oger Cj & 
L b(t) = bo + on! = bree deegrejer ds + © ob tren acon eye 


¢’(é) a ¢” + n” = £1647 € 77 En €jr Cir Es + > Ci. : £1047 Cj 00x &. 


| E = £1€ne;; 2 

IL o'(é) = & 4 9! = brejenesé + > Ch £1e,e; ke 
o(E) = E" +n” = frenejeite + > ch fren ents. 

In Case I, ¢’ and £” can be further straightened, thus 


, an? 47 sovave 
EE +m = Srey ey Soezr eye bs + pe £16; Cy Sex £3 
k 
iA ar / cree 
EE Hm = F165 ye Ege jer ger Eg + Da £164 2.17 Cx 3. 
k 


And straightening the ee; resp. ee; terms in 7’ resp. 7”, one finds that 
/ . . a 
n + m can be straightened into the same form as 7; + 7 —giving finally 


gveger 


, / 4? eae 
"(E) =e t+mtm + > » Chr Chr’ Exe xe Exeurr Es. 


Similarly in case II, evidently h > i > j, and so we can straighten further by 
setting 


£ — frevejenge + > Ch E1esex bo 
— b1e;eseade + > Ch frexenge + > Ch Exe: 6 
me al” 

£"" —> bresenesk + > Ch Exene: ts 
£1€;€:€n 2 + > Ce f1€;en 2 + x Ch E161. 04 £2 


/ , 
=i"4+m+m. 
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Hence it is sufficient to show that 7’ +. m +m and m + 7” + no” can be 
straightened into the same form. But if one transposes exe;, exei, een, €s€k, 
and exe, Whenever they are in the wrong order, one finds that this assertion 
comes down precisely to Jacobi’s identity, which completes the proof. 


2 Corollaries of Lemmas 1-2. From Lemmas 1-2 one can deduce our first 
main result. For starting with a Lie algebra purely abstractly, taking the 
canonical polynomials in its basis-elements e; as elements of a hypercomplex 
algebra A, multiplying them symbolically and straightening the products to 
canonical form through Lemma 1, it follows‘ from Lemma 2 that multiplication 
is associative. Hence since by the first identity of Lie-Jacobi and definition, 


Cj; — ee; = Cy ex. 


TueoreM 1: Any Lie algebra L can be imbedded in a linear associative al- 
gebra A,(L). 

Corottary: Any Lie algebra is isomorphic with an algebra of infinite matrices, 
with respect to the operation of forming alternants [XY] = YX — XY. 

For if one adds to the basis of any linear associative algebra A a “principal 
unit” e satisfying ex = xe = = for all x, then the correspondence between the 
elements x of A and the linear transformations X: a — ax = X(a) of A is 
isomorphic (it is in any case homomorphic). This well-known construction 
goes back to Poincaré.’ 

Now recall that if A is any linear associative algebra in which L is embedded 
isomorphically and which is generated by L, then the correspondence between 
polynomials in the basis-elements e; of L and their values in A preserves all 
equalities true in A,(Z). Hence 


THEOREM 2: The ways of imbedding a given Lie algebra L in a linear associative 
algebra A are all obtained by taking the invariant subalgebras S of A.(L) which 
place the different linear combinations of the e; in different residue classes, and 
then forming the homomorphic images A,(L)/S. 


Thus A,(Z) is a kind of universal linear associative algebra containing L. 
Finally, we have as a corollary of Theorem 1, 


TaEorEeM 3: The free Lie algebra with n generators is isomorphic with the free 
algebra of alternants involving n symbols. 


In other words, the identities of Lie-Jacobi imply all all other identities true 
of alternants. 


The author has been informed that much of §3 was also discovered, but not 
published, by E. Artin. 


cae Se 


; Given monomials £; , & and & , one writes down £1 & purely symbolically; straight- 
ening first £1 & one gets (£1 &)& , and straightening first & £ one gets £1(t2 &). 
_ ‘L. E. Dickson, “Algebras and their arithmetics,”’ Chicago, 1922, pp. 92-98 discusses 
it more elaborately. 
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4. The case of nilpotent Lie algebras. A Lie algebra L is called nilpotent 
if and only if all its brackets of sufficient length w vanish,° and a matrix 
X = || x;; || is called triangular if and only if the elements below the principal 
diagonal vanish (i.e., 7 < j implies z;; = 0), and properly triangular if and only 
if in addition the principal diagonal consists of zeros. 

Suppose L is a nilpotent Lie algebra. Define L; = L, and Ly, [k = 2, 3, ---] 
inductively as the set of all brackets [zy] with xreL; and yeL,_;, and 
0 <i < k—that is, define L; as the set of all complex brackets of length k. 
Further, let J, [A = 1, 2, 3, ---] denote the set of linear combinations of ele- 
ments of Ly, Lnsi, Liye, «+ -—that is, of polynomials in brackets of length =h. 
Then the J; are invariant subalgebras of L, and by the hypothesis of nilpotence, 
for some finite integer w, 


Lz=zJiy>ds>dJs3>°:°->J_e=%. 


Evidently L possesses a basis of linearly independent elements ¢,, --- , ¢, 
such that 


€1,°**,€na) area basis for J» 


€1,°**,€na) area basis for J »2 


€1,°** , €n(w—1) are @ basis for J; = L. 
Moreover obviously n(1) < n(2) <---< n(w — 1) = ©. 

Now taking the particular basis e;, --- ,e,, let us form A,(Z) as in §3. 
And to each e; let us ascribe a numerical “weight”? s = s(e,), so chosen that e 
is in J, but not in J.4;. Further, to each product let us ascribe as “weight” 
the sum of the weights of its factors, and to each polynomial, the least of the 
weights of its monomial terms. Evidently (since if [e:e;] = cx’ ex , then s(k) < 
s(i) + s(j) implies cj? = 0 by our choice of basis), straightening preserves lower 
bounds to the weights of the terms of any polynomial, and multiplying a poly- 
nomial by another has the same property. 

Hence the polynomials of weight >w constitute an invariant subalgebra S 
of A,(L). 

But A.(L)/S has the canonical power-products of length <w as a (redundant) 
basis; hence it has a finite order at most r + r° + --- +r”. Since finally no 
linear combination of the e; is of weight >w, we see that the different elements 
of L lie in different residue classes of S, and so 


TuEeorEM 4: Any nilpotent Lie algebra of finite order can be imbedded in a 
(nilpotent) linear associative algebra of finite order. 


If one defines w-nilpotence as meaning that all brackets of length =w vanish, 
one gets (using Poincaré’s construction again) the somewhat more precise 





6 By the “length” of any (complex) bracket such as [[zy]z] is meant the number of letters 
in it, repeated occurrences of the same letter being counted repeatedly. 
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CoroLtLaRy: Any w-nilpotent Lie algebra of order r is isomorphic with a Lie 
algebra of finite matrices of degree at most (r°** — 1)/(r — 1), with coefficients 
in the same field. 

While if one arranges the basis-elements of A,(L)/S in order of increasing 
weight (starting with the adjoined principal unit ¢, of weight zero), one sees— 
again since multiplying by a monomial increases, and straightening never de- 
creases weight—that 


TueoreM 5: The matrices of the preceding corollary are properly triangular. 


5. Corollaries of Theorems 4-5. So far no restriction except commutativity 
has been placed on the abstract field F underlying the algebras discussed. We 
shall now specialize to the cases of the real and complex fields, and study the 
implications of Theorems 4-5 for the theory of finite continuous groups— 
assuming Lie’s Third Fundamental Theorem (but not its converse!). 

First let us term a Lie group nucleus “hypercentral” if and only if its infini- 
tesimal algebra is nilpotent.’ 


THEeorEM 6: Every hypercentral Lie group nucleus G is locally isomorphic with 
a group of matrices whose manifold is homeomorphic in the large with Cartesian 
r-space. 


Proor: By definition, the infinitesimal algebra of G is nilpotent—and so by 
Theorem 5 is isomorphic with an algebra A of properly triangular matrices. It 
follows by Lie’s Third Fundamental Theorem that G is locally isomorphic with 
the Lie group nucleus generated by these matrices. 

But these matrices are properly triangular, and so all logarithmic and expo- 
nential series’ involving them or even polynomials in them (which will also be 
properly triangular) are actually polynomial—contain only a finite number of 
non-vanishing terms. Hence by a formal identity of Hausdorff,’ exponentials 
of these matrices are a group G* in the large. Actually, ee” = e**'” where 
¢(X, Y) is a suitable polynomial in X and Y and their alternants. Again, 
since the logarithmic series is finite, and a fortiori convergent and continuous, 
the correspondence X $s e* — J is bicontinuous. Therefore canonical param- 
eters give a one-one representation in the large, and the group G* has a manifold 
homeomorphic with the linear r-space of its infinitesimal generators. 


Corotiary 1: The universal covering group of every hypercentral Lie group is 
topologically isomorphic in the large with a linear group.” 


SS 
' This is equivalent to requiring that its algebraically defined ‘‘lower central series’’ 
(using the terminology of P. Hall) terminate with the identity. 
* By e* we mean 1 + X + (1/2!)X?2 + --- , and by Log (1 + X), the limit of the series 
s~ 3X?+ 3X%— ... in case it exists. The ideas used are to be found in J. von Neumann, 
Gruppen linearen Transformationen,’’ Math. Zeits. 30 (1929), 3-42. 
°F, Hausdorff, ‘‘Die symbolische Exponentialformel in der Gruppentheorie,’’ Leipz. 
Ber. 58 (1906), Theorem B, p. 29. ; 
'* By Schreier’s well-known theorem that any two simply connected locally isomorphic 








532 GARRETT BIRKHOFF 


Corouuary 2: The converse of Lie’s Third Fundamental Theorem is valid in 
the large for nilpotent Lie algebras (with real or complex coefficients). 

Corollary 2 has been proved without restriction by E. Cartan [Fasc. 42 des 
Sci. Math. (1930), p. 18, bottom]. 


6. Linear groups with nilpotent algebras." In this section an argument of 
E. Cartan (communicated by letter) is given, which gives the topology of the 
most general linear group “of rank zero” (i.e., with nilpotent Lie algebra), 

Let L be any nilpotent Lie algebra of finite order r; then L is by §5 the 
infinitesimal algebra of a simply connected group U of strictly triangular 
matrices—in which the exponential and logarithmic functions are both single- 
valued. By Schreier’s theory of local isomorphisms one concludes that to get 
the most general Lie group having L for Lie algebra, one should take a general 
discrete subgroup & of the central 3 of U, and form 1/8. 

But since two matrices X e UW and Y el commute if and only if Log Y and 
Log X commute, 3 is connected, and even isomorphic with the translation- 
group of n-space, while & is generated by k < n linearly independent matrices 
Z,,°::,4Z,. Taking the Log Z;, we see that & is part of a k-parameter con- 
nected normal subgroup &* of 3, whence R*/R is the k-parameter torus group. 

Therefore if U/R is isomorphic with a linear group, R*/R corresponds to a 
group of diagonal matrices whose non-zero terms all have absolute value unity, 
and which commute with every X ¢« (U/8). And by a theorem proved else- 
where,” the derived subgroup WW’ of U1 generated by Poisson brackets of its 
infinitesimal generators, can have no elements in common with this. 

It follows that by taking suitable generators in 1i’/U, we can get a subgroup & 
containing U1’, such that © N K* = 0, G U K* = L. But K* being in the 
central, we see that L must be the direct sum of & (which is simply connected, 
by reference to U1) and the toroidal group. This proves 


THEOREM 7: Any group of matrices whose Lie algebra is nilpotent is a direct 
product of a toroidal group of diagonal matrices and of a simply connected Lie group. 


HARVARD UNIVERSITY. 





groups are isomorphic in the large. But there exist hypercentral Lie groups simply iso- 
morphic with no linear group even in the purely algebraic sense. Cf. the author’s note, 
Bull. Am. Math. Soc. 42 (1936), 883-888. Also, Cartan has pointed out (‘‘La topologie 
des groupes de Lie,’’ Hermann, Paris, 1936, p. 18) that the universal covering group of 
the projective group on one real variable is topologically isomorphic with no linear group 
—which controverts the conjecture that all simply connected Lie groups are isomorphic 
in the large with linear groups. 

11 Added March 4, 1937. 

12 ‘Lie groups isomorphic with no linear group,’”’ Bull. Am. Math. Soc. 42 (1936), 882-8. 
The result used is Theorem 2. 
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